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SURVEY  OF  MATHEMATICAL  PROGRAMMING 


by 

R.  M.  THRALL 


at  the 

American  Mathematical  Society  Summer  Seminar 

on  the 

Mathematics  of  the  Decision  Sciences 
Stanford  University 
July  -  August  1967 


1.  Introduction 


Optimization  problems  have  been  part  of  mathematics  from  its 
earliest  days.  The  general  constrained  optimization  problem  can  be  writ¬ 
ten  in  form 

min  q?(X)  for  X  c  P 

where  P  is  some  set  and  9  Is  a  (real  valued)  scalar  function  whose 
domain  contains  P.  Linear  programming  refers  to  the  special  case  in 
which  P  is  a  polyhedral  subset  of  a  vector  space  and  9  is  linear. 

Let  A  =  ]  |  |  |  be  a  p-by-m  matrix,  let  X  *  I!  II, 

C  -  II  cill  be  m-by-1  vectors,  let  B  be  a  p-by-1  vector  and  let  d 
be  a  scalar.  [In  these  lectures  all  scalars  will  be  real  numbers.] 

Then  the  linear  program 

(1.1)  min  Z  -  d  +  CTX 

subject  to  the  constraints 

AX  »  B 
X  >  0 

is  said  to  be  in  standard  form.  We  sometimes  call  A  the  coefficient 
matrix,  X  the  activity  vector,  C  the  cost  vector,  B  the  constraint  vec¬ 
tor,  and  d  the  fixed  cost  or  initial  cost.  Then 

p  =  {x|ax«b,  x  £  o  } 

is  called  the  set  of  feasible  vectors.  If  P  is  empty, the  problem  is  said 
to  be  infeasible. 

A  problem  in  standard  form  can  be  presented  as  the  tableau 

matrix 


(1.2) 

-d 

C* 

M  - 

B 

A 

(  1  ) 


or  in  more  detail 


(  2  ) 


(1.3) 


1 

=  X1  xj  xk 

-d 

c,  •••  c.  •••  c,  •••  c_ 

1  J  k  n 

bl 

an***  aij***  aik***  am 

bi 

an**#  aiJ* **  aik#"  am 

bh 

“hi***  ahj** *  ahk*"  *hn 
*  *  •  * 

*  »  »  • 

bP 

apl“*  apJ*“  apk*  *  *  apn 

Hie  zeroth  (top)  row  is  read  as 

Z  -  d  -  C,Xj  +  .  .  .  ♦  C  V 

and  the  1-th  row  as 

bi  ‘allxl+  ‘  ‘  *  ainxn  (  1  "  *  *  •>  P  )* 

As  a  notational  convierce  we  sometimes  write 

(l.k)  ■  -d.  «0J  ■  'j  <  J  ■  l.  •  •  ••  n).  *l0  *  \  *  !•  •  •  ->P> 

For  example. 


0 

-8 

•19 

-7 

o 

0 

25 

5 

it 

1 

1 

0 

50 

1 

5 

3 

0 

1 

describes  a  program  in  standard  fora^having  p  ■  2  and  a  »  5. 

The  basic  computational  step  Involved  In  the  simplex  algor* 
ltha  for  solving  linear  programming  problems  is  the  pivot  operation  of 
Gaussian  elimination.  To  pivot  on  a  position  (h,k)  we  must  have  the 
pivot  element  a^  different  from  zero.  The  pivot  operation  has  two 

steps: 

normalize:  Divide  the  pivot  row  (row  h)  by  the  pivot  element; 
Swtep  out:  Subtract  suitable  multiples  of  the  pivot  row  from 
the  remaining  rows  so  ae  to  obtain  zeros  in  all  positions  of  the  pivot 
column  (column  k)  except  for  the  1  in  the  pivot  position  (h,k). 


Uius  after  pivoting  on  position  (h,k)  the  matrix  M  of  (1.2) 
(or  (1*3)  becomes 


J  k 


“ij  *  aij“*ikah</ahk  ^  *0,1,. 

For  example ,  if  we  pivot  on  position  (l,2)  in  (1.3)  we  get 


475/4  25/4 

0 

-9/4 

19/4 

0 

25/4  3  A 

1 

1/4 

1/4 

0 

125  A  -5/4 

0 

9/4 

-3/4 

1 

Clearly,  pivoting  on  position  (h«k)  represents  solving  equation 


b  for  and  using  this  equation  to  eliminate  from  the  remelnli* 
equations. 

Leva  1A.  Lst 


Is  a  non -singular  (p  +  l)  -  by  -  (p  +  1)  matrix  differing  from  Identity 
only  in  column  h,  l.e. 


(1.12) 


^oh 

qih 


*hh 


(Kronecher's  delta  function) 
,1  =  0,1, ... ,p;  j  ^  k 

*cb/ahk 

*aik/ahk  1  *  x>  •••>  PJ  i  +  k 


Moreover,  the  problems  described  by  M  and  M*  have  the  same  feasible  sets 
P  and  P*  and  for  any  feasible  vector  X  we  have 

d*  +  c*TX  -  d  +  cTX. 


Theorem  JLB.  Let  M*  be  obtained  from  M  by  a  sequence  of  pivot  operations 
on  positions  not  in  the  zeroth  row  or  column.  Then  (1.10)  holds  for  a 
nonsingular  matrix  Q  having  the  form 


R 

P 


l.e..  the  efefro-tft  column  of  Q  is  the  Initial  unit  vector.  The  linear  pro¬ 
grams  defined  by  M*  and  M  are  equivalent  in  the  sense  that  every  con¬ 
straint  equation  of  either  problem  is  a  linear  combination  of  constraint 
equations  of  the  other  and  that  for  any  vector  X  which  satisfies  the 
constraint  equations  AX  ~  B  or,  equivalently,  A*X  -  B*  we  have  also 
d  +  CTX  =  d*  +  C*'ilX. 

Note:  the  last  statement  is  formulated  to  include  vectors 
X  which  may  be  infeasible  in  the  sense  of  having  some  negative  com¬ 
ponents  . 

Proof:  Using  the  theory  of  partitioned  matrices  we  observe  that  a  prod¬ 
uct  of  matrices  of  the  form  ( 1 .11 )  each  having  h  =  1  and  k  -  1  has  the 
form  (1.13).  Moreover,  also  has  the  form  (1.13).  The  conclusions 
concerning  equivalence  then  follow  from  (1.10).  In  particular  we  observe 
that 


||.d*c*T||  •  ! | -del | 1  •  P | | BA | |  ( 


ie.,the  cost  rows  differ  by  a  linear  combination  of  constraint  rows.  Novr 


d*  +  C*TX 


-  | | -d*C-Tj |  ||J1||  -  | | -dCT| |  +  B  ||  BA| | | |  X1!!  - 

d  +  CTX  +  RO  -  d  +  CTX. 

For  later  use  we  note  that  if 

mm  m 

1  R* 

Q’1* 

o  p* 


then 

(1.15)  P*  -  P"1,  R*  -  -HP'1. 

'.Tie  conclusion  of  theorem  IB  states  that  a  sequence  of  pivota 
leads  from  any  linear  programing  problem  M  to  a  new  one  M*  which  is 
equivalent  to  it  in  a  very  strong  sense  of  equivalence. 


2.  Canonical  fora.  Convergence 


(  6  ) 


We  introduce  a  notation  for  submatrices  which  will  be  useful 
in  what  follows. 


Let  A  be  a  p-by-n  matrix,  let  C  be  an  n-by-1  column 
vector,  and  let  R  =  (r^. ...,rq)  be  any  sequence  of  integers  with 
1  i  rj  i  n  •  Then  we  denote  by  Ap  the  p-by-q  matrix  whose  columns 


in  order  are  Ar^,...,Ar  ,  and  we  denote  by 

vector  whose  corn  nents  ?n  order  are  cr^,.. 


Cp  the  q-by- 

crq’  i,e,» 


column 


(2.1) 


For  example,  let 

10-2110 
A  -  2  110-20 

-3  0  2  0  1  1 


d  =  14  , 

and  let  K  =  (2,  /,  3>  l);then 


and  for  S  =  (4,  2,  6)  , 

1  0 
AS  =  0  1 

0  0 


0  0 

0  ,  Cg  =  0 

1  0 


Let 


(  T  ) 


(2.2)  S  =  (s-^,...,Sp) 

be  an  ordered  sequence  of  p  different  integers  selected  from  the  set 
{l,  2, ...,n}  .  The  problem  (l.l)  is  said  to  be  in  canonical  form  with 
respect  to  the  baaic  sequence  S  if 


(Cl) 

AS 

=  JP 

,  canonical  coefficients, 

(C2) 

cs 

=  op 

,  canonical  costs. 

(C3) 

B 

£  0 

,  canonical  constants. 

Note 

that 

Cl 

states  that  the  system  of  equations  has  been  solved  for 

Xg^, . ..jX^  (the  basic  variables^  in  terms  of  the  remaining  (non-basic) 
variables;*’  C2  states  that  the  equations  have  been  used  to  eliminate  the 
basic  variables  from  tne  cost  function;  and  ,C3  states  that  in  the 
solved  form  cf  the  equations  the  constant  terms  are  non- negative. 


Associated  with  a  problem  in  canonical  form, there  is  a  feasible 
vector  Xs  and  corresponding  cost  zs  given  by 


(2.3) 


xSi  =  b^  (i  =  p) 

x|  ^  0  gts 


This  solution  is  referred  to  as  the  basic  solution  associated 
with  the  basic  sequence  S  . 


If  we  let  G  denote  the  sequence  remaining  when  the  elements 
of  S  are  deleted  from  (l, ...,s)  then  (2.3)  can  be  written  as 

(2.3')  xf  =  B,  Xq  -  0,  zS  =  d  . 

The  example  (1.5)  is  canonical  with  respect  to  the  basic  sequence 
S  =  (J+,5)-  Here  G  =  (l,  2,  3)  and  the  associated  basic  solution  and 
cost  are 


(  8  ) 


A  vector  F  =  | |fq  f2  • • •  fn| I  is  said  to  be  lexicographically 
positive,  written  0  ,  if  the  first  non -zero  component  of  F  is 
positive,  i.e.,  if  for  some  i,  1  g  i  g  n  ,  we  have  f ±  =  ...  =  f^  =  0  , 
f i  >  0  .  For  example,  j  j 2  -1  3 j j  ,  |  f  0  0  l||  ,  and  i  1 0  2  -3  4 1  j 

are  all  lexicographically  positive  whereas  j  j  0  -1  7||  ,  ||0  0  oil  , 

•  and  1 1 -1  7  8j  |  are  not.  If  F  -  0  we  say  that  F  is  lexico¬ 

graphically  greater  than  G  ,  written  F  ^  G  . 

If, for  a  problem  in  canonical  form, the  strict  inequality  B  >  0 
holds, then  we  have  automatically  that:  — every  row  of  j |b  a| |  is 

lexicographically  positive.  If  some  components  of  B  are  zero  but  the 
first  non -zero  is  each  such  row  is  positive  then  Ck  holds-  If, 

in  particular,  S  =  (1,  2,  .  ..,p),  then  C k  is  a  consequence  of  C3 
and  Cl  .  More  general] y,  if  Cl  and  J3  hold,  a  permutation  of  columns 
l,...,n  will  yield  C4  . 

Theorem  2A.  Consider  a  linear  programming  problem  in  canonical  form. 

Then  one  of  the  following  alternatives  holds: 

(i)  C  >  0  and  the  associated  basic  solution  is  optimal. 

(ii)  There  exists  k  such  that  ck  <  0  and  Ak  <  0  and  the  cost 

function  has  no  lower  bound. 

(iii)  There  exists  k  such  that  ck  <  0  and  i  such  that  >  0  , 

and  if  C4  holds  there  is  a  position  in  column  k  at  which  a 

pivot  transformation  results  in  an  equivalent  problem  in  canonical 
form,  satisfying  &  ,  and,  for  which  the  zero-th  row  | | -d*  c* | | 
is  lexicographically  greater  than  that  | J -a  C j  j  of  the  initial 
problem . 

Proof:  To  establish  (i)  we  note  that,  for  any  feasible  X  we  have 

(2.*0  Z  =  d  +  CTX  =  d  +  C|XS  +  cJXq  =  d  +  c£xG  *  d  -  ZS 

The  inequality  follows  from  the  fact  that  Cq  i  0  and  Xq  £  0  ;  hence 
is  optimal  and  Zs  is  the  minimal  cost. 

The  matrix  M  in  (2.5)  illustrates  case  (i). 


-20 

0 

0 

k 

9 

1 

0 

3 

-2 

7 

2 

0 

1 

3 

-3 

-2 

(2.5) 


To  establish  (ii)  consider  the  equations 
(2.6)  bi  =  xSi  +  aik  xk  i  =  1>--->P 

2  «=  d  +  Ck  Xk 

obtained  from  (l.l)  by  setting  all  of  the  ncn-basic  variables  equal  to 
zero  except  for  xk  .  As  xk  takes  on  larger  and  larger  positive  values  f 
each  xSi  either  grows  larger  also  (if  aik  <  0  )  or  stays  constant 
(if  a^  »  0  )  and  z  either  is  initially  or  ultimately  becomes  negative 
and  takes  on  larger  and  larger  negative  values.  Thus  z  -*  -  00  as 
xk  -»  os  and  always  with  feasible  vectors  X  .  The  situation  is  pictured 
ir.  Figure  2.1. 


The  matrix  M  in  (2.7)  illustrates  case  (ii)  with  k  =  4  . 

(2.7) 

M  = 

Case  (iii)  differs  from  case  (ii)  since  for  a^  >  0  ,  ejuations 
(2.6)  place  an  upper  bound  of  bjAik  on  xk  •  See  Figure  2.2. 

For  feasibility,  we  must  have 


-20 

0 

0 

-k 

p 

9 

1 

0 

3 

-2 

7 

2 

0 

1 

3 

-3 

-2 

(2.8)  xk  %  min  {bi/a^  |  aik  >  0}  =  bh/ahk  . 

If  now  we  pivot  at  (h,k),  the  new  basic  solution  is  given  by  (2.6)  with 
xk  =  hh/ahk,  and  t>h  >  0  »  the  cosb  z  is  reduced  by  addition  of 
the  negative  amount  bk  ck/akk  ;  i.e.  f 

(2*9)  z‘  =  z  -  *>h  ck/ahk  . 


Tin  matrix  in  (2.10)  illustrates  case  (iii)  with  k  *  3> 

h  ■  2  * 


(2.10) 

-20 

0 

0 

-3 

k 

5 

" 

M  « 

9 

1 

0 

3 

-2 

2 

0 

1 

3 

3 

-2 

Vaik 

9/3 

2/3 


and  after  pivoting  on  position  (2,3)  we  obtain 


*2.11) 

M  - 


GO 

H 

I 

0  10  7  3 

7 

2/3 

1-1  0-5  9 
01/31  1  -2/3 

( 11 ) 


which  is  in  case  (i)  and  therefore  the  associated  basic  solution 


(2.12) 


X  =  2/3  ,  z  =  18 

0 
0 


is  optimal. 


In  this  example  we  had  B  >  0  ,  ana  hence  reduction  in  the 
cost  z  (from  20  to  18).  It  sometimes  happens  that  =  0  and  then 
(cf.  (2.9))  there  is  no  improvement  in  z  ;  this  is  the  phenomena 
called  degeneracy .  To  handle  degeneracy  we  look  not  jurt  at  -d*  but 
at  the  entire  row  J | - d  c  j [  »  For  it  we  have 

(2.13)  ||-d*  C*T |!  =  ||-d  CT||  -  ||bh  ahi  ...  ahj| 

and  now  because  of  our  assumption  together  with  the  fact  that 
-ck/ahk  is  positive,  we  conclude  from  (2.13)  that  the  transformed  cost 
row  is  lexicographically  greater  than  the  initial  one;  i.e-,we  have 
strict  lexicographic  improvement  in  the  cost  row. 

However,  in  case  there  are  ties  in  (2.8),  i.e^if  h  is  not 
uniquely  defined  by  (2.8)  then  we  must  choose  h  so  as  to  preserve 
condition  C4  .  This  is  achieved  by  the  following  requirement: 

(2*l4)  «^k  &hl  ahn**  lexmin  ^  Hbi  ail  •**  ainlt|aik  >  °}  * 

k 

This  choice  of  h  is  unique  since  no  two  rows  of  M  can  be  equal. 

To  see  that  M  now  satisfies  C1*  we  first  observe  tnat  row 


h  of  M  is 


(2.15) 


■J—  I  l*h  ahl  •**  ahnl 


and  is  lexicographically  positive  because  the  h-th  row  of  M  was. 
Next,  if  i  »  h  we  have  for  the  i-th  row  of  M 


(2.16)  ||bi  a^  ...  a£n|  |  *  ||bi  a^  ...  a^l 


I Ibh  ahl  ahnl I 


aik^i“  N bi  ail  *•*  ain 1 1  *  Mbh  ahl  •••  a  hnlj). 


(  12  ) 


Now,  if  £  0  the  first  line  of  (2 .16 )  shows  that  row  (i)  is  either 
unchanged  or  is  lexicographically  increased;  and  if  a^  >  0  our  choice 
of  h  via  (2,lU)  quarantees  that  the  difference  in  the  second  line  of 
(2.16)  is  lexicographically  negative.  Thus  C4  holds  for  the  transformed 
matrix  M*  .  This  completes  the  proof  of  Theorem  2A. 

-ft 

Lemma  2b.  Let  M  and  M  define  equivalent  linear  programming  problems 
(in  the  sense  of  TheoremlB)  in  canonical  form  with  respect  to  the  same 
basic  sequence  S  .  Then  M  =  M*  . 

Proof:  Suppose  that  (1.10)  holds  for  Q  given  by  (1.13).  Then  using 
subscripts  to  denote  columns  of  a  matrix  we  have 

* 

=  QMj  ;  j  =  o, •. .,n  . 

ft 

In  particular,  if  jeS  ,  say  j  =  sh  we  have  MSh  -  MSh  =  U^+1  the 
(h+l)st  unit  vector  and  so 

uh+l  =  QUh+l  =  %  >  h  =  1, ...,p  . 

But  Qq  =  ;  hence  Q  =  Ip+1  and  M  =  M*  . 

Theorem  2C.  (first  convergence  Theorem)  Consider  a  linear  programming 
problem  in  canonical  form.  Then  after  a  finite  number  of  pivot  operations 
one  of  the  terminal  states  (i)  or  (ii)  of  Theorem  2a  is  attained. 

First  permute  columns,  if  necessary, so  that  (C4)  holds.  Then 

Proof:  Consider  a  sequence  M  =  1^,,  M]^,  M~,  . of  canonical  tableau 
matrices  all  in  case  (iii)  and  satisfying  (CU)  and  where  for  each 
i(i  *  1,  ...,t)  Mi  is  obtained  from  by  pivoting  on  a  position 

(hi,  ki)  selected  so  that  is  a  negative  entry  in  the  cost 

row  of  Mi_i  and  hi  is  determined  using  (2.lU).  Then,  since  by  the 
conclusion  for  case  (iii)  of  Theorem  2a,  there  is  lexicographic  improve¬ 
ment  in  the  zero-th  row  at  each  stage,  no  two  matrices  in  the  sequence  can 
be  equal.  It  follows  from  Lemma  2B  that  no  two  of  the  corresponding 
basic  sequences  can  be  equal.  Hence,  t.+l  cannot  exceed  the  number  of 
possible  basic  sequences.  We  conclude  t  is  less  than  the  number 
P(n,p)  of  permutations  of  n  objects  taken  p  at  a  time  and  therefore 
t  is  bounded.  On  the  other  hand, the  sequence  can  be  extended  unless  one 
of  the  terminal  states  (i)  or  (ii)  has  been  reached.  We  conclude  that 
a  terminal  state  can  be  reached  after  less  then  P(n,p)  pivots. 


(  13  ) 


Fortunately,  in  practice  the  simplex  algorithm  has  been  found 
to  terminate  after  relatively  few  pivots, although  there  is  a  yet  not  a 
mathematically  established  bound  which  comes  close  to  explaining  computa¬ 
tional  results. 


( 1* ) 


3.  Reduction  to  Canonical  Form 


Having  obtained  a  convergence  theorem  for  problems  in  canonical 
foxm,  we  now  turn  to  the  matter  of  reduction  to  canonical  form.  We  organize 
the  question  of  reduction  under  sequence  convergence  theorems,  each  cover¬ 
ing  a  more  general  case  than  the  former,  until  we  reach  one  which  applies 
to  the  most  general  form  of  linear  programming. 


Theorem  3A.  (Second  convergence  theorem . )  Let  M  represent  a  problem 
in  standard  form  for  which  Cl  holds.  Then  there  exists  a  finite  sequence 
of  pivot  operations  which  will  terminate  in  an  equivalent  problem  M* 
which  is  in  state  (i)  or  (ii)  of  Theorem  2b  or  which 

(iv)  has  an  infeasible  constraint  of  the  form  "negative 

scalar  equals  sum  of  products  of  non-negative  scalars", 

i.e.,  M*  has  a  row  of  the  form 

(3-1)  IK  an  •••  ainl  I 

where  bj  <  0  and  a^j  £  0  j  ■  1, ...,n  . 

We  sometimes  represent  states  (i),  (ii),  (iv)  symbolically  by 


(i) 


© 

© 

(ii) 


- 

© 

© 
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(iv) 


- 

©  _ 

Proof:  Conceptually,  we  may  achieve  C2  by  using  the  equations  to 
eliminate  the  basic  variables  from  the  cost  function.  Analytically,  we 
proceed  as  follows.  Let 

( 3  »2 )  »  1 1  i-i  . . .  zn  1 1  *  CST  A,  Zj  *  CST  B  ; 

then  successive  pivots  on  the  p  positions,  £l,  s^),  (2,  sg),  ..., 

(p,  Sp)  will  yield  a  new  cost  row  ||-d*  C*A| |  where 

(3-3)  C*  -  C  -  Z  ,  -d#  -  -  d  -  zc  , 

and  will  not  change  any  of  the  last  p  rows  of  N  .  Since  Ag..  is 

the  J-th  unit  vector  in  p-space,  z8  ■  c8j  so  that  Zsj  ■  0  (j  «  1, ...,p); 

hence,  C2  is  now  satisfied. 

Tables  3*1  and  3*2  give,  respectively,  a  schematic  representa¬ 
tion  and  a  numerical  example  of  this  process. 

Next,  if  C3  is  not  satisfied,  we  consider  an  auxiliary  sub¬ 
problem. 


(3-M 


-d' 

C,T 

B* 

A’ 

whose  zero-th  row  is  any  row  of  N  for  which  the  constant  term  is 
negative,  and  where  j  j B*  A'il  consists  of  all  rows  of  K  for  which 
the  constant  term  is  positive  or  zero .  Now  M'  is  in  canonical  form, 
and  we  apply  the  first  convergence  theorem  to  M'  to  obtain  a  sequence 
of  pivot  positions  leading  to  either  state  (i)  or  state  (ii).  We  actually 
pivot  in  all  of  M  ,  so  as  to  preserve  Cl  or  C2  for  it.  Let 
and  represent  the  resulting  problem  and  auxiliary  subproblem. 

f 

Jk 
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(  17  ) 


If  state  (i)  obtains  for  M*  and  -d*^  <  0  ,  then  M*  is 

in  state  (iv)  and,  therefore,  the  problem  is  infeasible.  If  state  (i) 

obtains  for  and  -d#-L  >  0  ,  we  form  a  new  subproblem,  M*H  , 

which  is  in  canonical  form  and  has  at  least  one  more  row  than  M'  had. 
[Actually,  if  -d**-  ^  0  ai'ter  any  pivot,  we  need  not  continue  to  a 
terminal  state  but  can  immediately  pass  to  a  "larger"  M*H  .] 

If  state  (ii)  obtains  with,  say,  <  0  and  a£^  <  0  , 

we  choose  the  cost  row  of  the  subproblem  as  pivot  row  and  col’san  k  as 

*1 

pivot  column.  Then  c^  is  the  pivot  element.  We  claim  that,  after 
pivoting,  the  resulting  M  will  be  feasible  every  row  of  M*^  (including 
its  cost  row)  and,  hence,  we  may  form  a  "larger"  as  the  continuing 

auxiliary  subproblem. 


To  verify  this  claim,  we  observe,  first,  that  -d***-  =  (-d**-)/ 
c^  >  0  ,  since  by  hypothesis  both  numerator  and  denominator  are  negative. 
Finally,  if  i  is  any  row  of  ,  we  have 


*  *  ♦l\l  »1  * 

bi  *  aik  (‘a  '/ck  ?  bi 


9 


since  by  hypothesis  the  first  term  in  the  fraction  is  nonpositive  (as 
an  element  of  A^1  )  and  tne  other  two  are  negative. 


Thus,  after  a  finite  number  of  auxiliary  subprobiems  of  increasing 
size,  we  must  *»Uher  reach  state  (iv)  or  obtain  an  M*  with  B*  >  0  and 
hence,  by  rheoi-m  -r’,  eventually  reach  state  (i)  or  state  (ii)  for  the 
full  problem.  This  coopiti  s  the  proof  of  Theorem  3A. 

Remark  If,  ^crcndipitously,  rows  of  M  not  ir.  become 

feasible  before  the  cost  row  of  does,  one  can  immediately  pass  to 

a  larger  subproblem.  The  desirability  of  including  the  search  for  this 
phenomenon  in  a  computer  program  is  still  an  open  question. 


Table  3*3  illustrates  pivot  defined  by  an  auxiliary  subproblem 
Note  that,  in  this  ex®ple,  one  pivot  achieves  one  more  feasible  row. 

An  additional  pivot  in  position  (3,1)  will  demonstrate  infeasibility  of 
the  problem. 

Theorem  38.  (Third  convergence  theorem)  .Let  M  represent 
a  problem  in  standard  form.  Then  there  exists  a  finite  sequence  of 
pivots  which  win  terminate  m  an  equivalent  problem  M  ,  wftjc^  gjLLUfcL 

(v)  has  a  constraint  of  the  fora  "nonzero  scalar 
equals  zero". 


ro  co 
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TABLE  3.3 

EXAMPLE  ILLUSTRATING  AUXILIARY  PROBLEM 


B  ^2  A3  A4  a5  A7 


(a)  Initial  tableau 


0 

0 

2 

0 

-3 

l 

1 

0 

0 

3 

1 

2 

0 

3 

1 

0 

-1 

0 

2 

0 

2 

(b)  Auxiliary  tableau 


AFTER  PIVOT  AT  (2,  5) 
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i ,e. ,  M  has  a  row  of  the  form 


(3*5) 

where  bi  ^  0  and  = 

of  any  zero  rows. 


bj_  ail  *  * •  ainl I 


=  a.n  =  0  ,  or  satisfies  Cl  after  deletion 


Proof:  Suppose  that  S  =  (s^,  . Sq)  is  a  sequence  of  q  distinct 
integers,  where  0  i  q  <  p  such  ^hat 


(3-6) 


- 


Then  let  h  ■=  q  +  1  and  consider  the  h-th  row  of  M  .  If  it  satisfies 
(v),  the  problem  is  infeasible  and  we  stop.  If  the  entire  row  is  zero, 
we  delete  it  and  proceed  with  the  resulting  tableau.  Otherwise,  there 
exists  k  such  that  ±  Q  .  Because  of  (3-6),  k^S  .  We  now  pivot 

on  position  (h,k)  to  obtain  a  new  tableau  with  one  more  canonical 
column.  We  let  s(_+^  =  k.  S’  =  (sp  . .  .,Sq+^)  ,  and  (3*6)  holds  for  the 
transformed  tableau  relative  to  3’  .  Since  each  stage  either  stops 
with  (v)  or  decreases  p-q  ,  the  process  cannot  continue  indefinitely, 
and  the  theorem  is  established. 

The  first  three  reduction  theorems  show  that,  for  any  problem 
in  standard  form,  there  exists  a  finite  sequence  of  pivots,  which  results 
in  an  equivalent  problem  M*  in  one  of  the  terminal  forms  (i),  (ii), 

(iv)  or  (v).  Wc  now  define  the  general  linear  program  and  show  how  tc 
reduce  it  to  standard  form;  thus,  we  will  complete  our  proof  of  convergence 
of  the  simplex  algorithm. 


The  most  general  form  of  linear  programming  problem  has  both 
nonnegative  and  free  variables  and  both  inequalities  and  equations. 
Suppose  that  there  are  n^  nonnegative  variables,  ng  free  variables, 
p^  inequalities,  and  P2  equalities.  The  problem  takes  the  form 


(3«7) 


Minimize  z  =  d  +  X^  +  Xg 


subject  to  the  constraints 
(3-8) 

(3-9) 

(3.10) 

(3.11) 


All  X1  +  A12  x2  ^  B1 
Ag^  X^  Agg  Xg  —  Bg 


xx  £  0 


Xg  free 
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We  introduce  a  vector  X3  with  p^_  components,  set  C3  =  0  , 
replace  (3*7)  and  (3*8)  by  ’ 

(3.7')  Minimize  z  =  d  +  C-J  Xx  +  C2T  X2  +  C3T  X3  , 

(3.8' )  An  Xi  +  A12  X2  -  I?1  X3  =  Bx  , 

and  add  the  constraint 

(3*12)  x3  -  0  * 

We  call  X3  a  slack  vector. 

The  resulting  problem  would  be  in  standard  form  were  it  not  for 
the  free  variables.  A  little  reflection  leads  to  the  observation  that 
basic  free  variables  present  no  difficulty  but  that  nonbasic  (i.e., 
independent)  free  variables  upset  the  logic  of  the  simplex  method.  This 
suggests  that  we  look  for  pivot  operations  which  will  bring  as  many  free 
variables  as  possible  ir-to  the  basis.  We  will  then  be  able  to  lay  the 
corresponding  equations  aside,  temporarily,  and  deal  with  a  smaller  prob¬ 
lem  in  standard  form. 


We  proceed  inductively  and  assume  for  some  q  with  0  %  q  £  n2 
that  there  are  q  free  variables  in  the  basis  and  that  by  permuting  rows 
(if  necessary)  these  free  vaiiables  axe  in  rows  1,...,  q  and  that  the 
basic  free  variable  in  row  i  is  in  column  s^(i  =  l,...,q)  .  If  q  <  n2 
there  exists  another  free  column  k  and  for  it  there  are  three  cases  to 
consider  (set  p  =  p-l  +  p2) 


aq+l,k  *  ***  *  ®p,k  ~  ck  =  0 

aq+l,k  *  *•*  =  ®pk  =  0  »  ck  +  0 
(c)  for  some  h  with  q  <  h  £  p  ,  a  ^  ^  0  . 


If  case  (a)  holds  for  every  nonbasic  free  column,  or  if  q  *  n2  , 
we  consider  the  subproblem  which  remains  when  we  delete  all  of  the  free 
columns  and  the  first  q  rows.  This  subproblem  is  in  standard  form  and 
therefore  can  be  handled  by  the  methods  already  developed.  If  the  sub- 
problem  has  a  basic  optimal  feasible  solution  (X3/,  X3),  we  obtain  a 
basic  optimal  feasible  solution  (X^,  X2°,  X3°)  for  the  main  problem 
by  setting  each  nonbasic  free  variable  equal  to  zero  and  by  setting 
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(3.13)  x°t  =  bt  -  (au  x,0  +  ...  +  alni  x^) 

,  o 

"  'ai,n1+n2+lxn1+n2+l 

Moreover,  the  minimum  value  of  z  is  the  same 
the  subproblem. 

If  case  (b)  holds,  then  either  there  is  no  lower  bound  for  the 
objective  function  or  the  problem  is  not  feasible.  For,  suppose  Y®  is 
any  feasible  solution  with  corresponding  objective  value  z®  .  Then 
define  a  vector  Y  as  follows: 

(3-lM  Ysi  =  “aik  ^  = 

yk  =  1 

=  0  for  all  other  i  . 

Then  Y  makes  the  left-hand  sides  of  (3*7')  and  (3*8)  zero,  so  that 
the  vector 

X  =  X°  +  tY 

is  (i)  feasible  for  all  real  numbers  t  ,  and  (ii)  yields  z  =  z°  +  tck  . 

It  is  clear  that  the  constraint  equations  (3*7')  and  (3*8)  are  satisfied 
by  X  .  Moreover,  for  all  t,  X1  *  X-^0  £  0  and  X^  *  X^°  >  0  ,  so  that 
X  satisfies  all  of  the  feasibility  requirements.  Next,  z  *  Cq  +  CTX 
=  (cQ  +  CTX°)  +  CtY  =  zQ  +  tck  (since  cSi  =  0,  i  =  l,...,q).  Now,  by 
giving  t  the  sign  opposite  that  of  ck  and  by  Baking  the  absolute 
value  of  t  sufficiently  large,  we  can  make  z  take  negative  values 
with  arbitrarily  large  absolute  value;  i.e.,  the  objective  function  has 
no  lower  bound.  Thus,  if  case  (b)  occurs,  we  stop  knowing  that  the  problem 
has  no  solution. 

Finally,  if  case  (c)  holds, we  (i)  pivot  at  position  (h,k),  (ii) 
permute  row  h  and  row  q+1  ,  (iii)  set  sQ+i  «  k  and  thereby  have  com¬ 
pleted  an  Inductive  step.  We  then  iterate  the  process  until  eventually 
we  reach  case  (a),  case  (b),  or  q  «  ng  . 

The  three  cases  are  illustrated,  respectively,  in  Tables  3*1* 

3.5,  and  3*6.  The  second  tableau  in  Table  3*6  comes  under  case  (a).  In 
all  of  the  examples  x^,  xg  are  nonnegative  variables,  X3,  x^  are  free 


a  xu  -  \ 

i,nl+n2+n3  n^+ng+n^ ' 

i  1,  , « . ,  q  » 

for  the  main  problem  and 
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TABLE  3.4 

CASE  (a)  TRANSITION  TO  SUBPROBLEM 


B  Aj  A2  Aj  Aj|  A5 


Pi  ■  1/  P2  *  2,  nj  »  2,  n2  »  2,  q  =  1,  S!  =  3,  k  =  4 


TABLE  3.5 

CASE  (b)  UNBOUNDED  OBJECTIVE  JUNCTION 


B  A^  A2  A^  Aj^  A^ 


z  =4  -  2t  -*  -»  as  t  -♦  +» 


(  2k  ) 


variables,  and  Xj-  is  a  slack  (nonnegative)  variable.  The  tableau 
corresponding  to  the  subproblem  in  the  first  example  is 


and  is  in  optimal  form  with  corresponding  solution 


k 

0 

2 


as  listed. 


The  subproblem  obtained  from  the  second  tableau  in  the  third  example  is 


B 


A^  Ag  Ac; 


E 

-1  9  -2 

F 

-1  3  -6 

After  pivoting  in  position  (1,2)  of  this  subproblem,  i.e.,  position  (3,2) 
of  the  corresponding  full  problem,  we  obtain 


-2 

2  0  16 

Mi. 

-1/3  1  -2 

which  is  in  case  (i),  so  that  the  original  problem  has  the  optimal  solution. 


z  =  2 


We  summarize  these  results  in  our  final  convergence  theorem. 

Theorem  3C.  (Fourth  convergence  theorem.)  Given  a  general  linear  program- 
jniggjgjg^e^  (3«7)  through  (3«ll),  there  exlBts  a  finite  sequence  of  pivot 
operations  which  either  leads  to  an  opMmal  solution,  demonstrates  in- 
feasiblllty,  or  demonstrates  unbounded  coat. 

Actually,  we  have  done  more,  because  we  have  given  an  algorithm 
which  selects  the  successive  pivot  positions  and  which  identifies  infeasibil 
ity  and  unboundedness  in  several  specific  forms. 


(  25  ) 


There  is  an  alternate  method  for  handling  free  variables, 
namely,  the  replacement  of  Xg  throughout 

(3-15)  X2  =  Xo  -  x£ 

where 

(3-16)  X2  £  0  ,  Xg  >  0  ; 

we  thus  have  an  equivalent  problem  with  only  nonnegative  variables. 

This  alternate  method  is,  for  theoretical  purposes,  quite  satisfactory; 
however,  it  fails  to  take  any  advantage  of  the  flexibility  which  free 
variables  afford  via  the  reduction  to  a  smaller  problem  after  the  free 
variables  are  pivoted  into  the  basis. 


ERRATA 


7.6  Replace  (1 ,  . ..,  s)  by  (1,  ...  ,n) 

8.11  Replace  is  by  in 

10  Top  figure.  Replace  by  by  b^ 

11.3  Replace  i  s  h  by  i  i  h 

12.4  Replace  negative  by  positive 

19.9  Delete  comma 

14.4  Should  read  .  .  .under  a  sequence  of  convergence 

26  (4.2)  Replace  d*  by  «d* 

(4.4)  Replace  allc  by  c^ 

Replace  agfc  by  a^ 

Replace  ck  by  apk 
31 .7  Replace  M  by 

36.i6  Replace  by 

37.12  Replace  Cgby  eg  twice 

41 .5  Should  read  .  . .  the  protein  illustration  b'^ 
represents  the. .  . 

41.11  Replace  second  r  by  t 

41.20  Replace  f  by  t 


INTRODUCTION  TO  LINEAR  PROGRAMMING 


R.  M.  Thrall 


IHntaDUCTION  TO  LINEAR  PROGRAMMING 


R.  M.  Thrall 


Chapter  Page 

1.  Introduction  . 1 

2.  Canonical  form,  Convergence  . . 6 

3.  Reduction  to  Canonical  Fora . 

A  Computational  Check;  lhe  Revised  Simplex  Method  ....  26 

5.  Duality .  30 

6.  Interpretation . 39 


(26) 


4.  A  Computational  Check;  The  Revised  Simplex  Method 
Let 


(4.1) 


be  the  initial  tableau  matrix  for  a  problem  in  standard  form,  and  let 


(4.2) 


M*  * 


d*  C* 
B*  A* 


T 


be  a  second  tableau  matrix  for  the  same  problem  which  we  assume  to  be  in 
canonical  form  relative  to  some  basic  sequence  S*  ■  (s^*,  . ..,  s  *). 

For  the  case  of  a  full  tableau  matrix  M  and  any  basic  sequence 
S  »  (s^,  ...,  s  )  we  modify  cur  previous  notation  for  submatrices  slightly 
and  thus  write  P 


(4.3) 


MS  “ 


1  C 


0  A 


to  designate  the  square  matrix  of  degree  p  ♦  1  consisting  of  the  first  unit 
vector  of  Vp  +  x  •  followed  by  columns  s^,  . ..,  s  of  M  (we  consider 
i I  ’g  1 1  ^  ^  column  0  of  M) . 

The  process  of  pivoting  on  position  (h,k)  of  M  can  be  regarded  as 
preraui  tipi  lection  of  H  by  the  n»trix 


(4.4) 


1  0 
0  1 


0  0 


-u/\*  • 

-A 


i/a 

hk 


-c  /a 
k  hx 


q  *  w.  ^‘vas!- 


(27) 


Here  F  differs  from  the  identity  matrix  I  +  only  in  column  h  +  1 

which  is  indicated  in  (^.4),  or  in  terms  of  $,  f^  =  "mi’K/mhk  for  a-*--4 
i  ^  h  and  f  =  l/ m^  .  It  follows  that  a  sequence  of  pivot  operations 
can  be  effected  by  a  single  matrix  multiplication,  and  the  reverse  se¬ 
quence  can  be  acheived  by  multiplying  by  the  inverse  of  this  same  matrix. 
Suppose  then  that 


(*.5) 


M*  =  QM,  M  =  Q  Mf 


Since  we  never  pivot  in  the  initial  row  or  column>.each  pivot  matrix  F 
has  Initial  column  jl  fj||  and  hence  so  does  their  product  Q  as  well  as  its 


inverse  Q‘4. 


(*.6) 


0  M*  ...  M*  ||=I 
.  s  *  s  *  II  p  +  1 

i  1  P 


It  follows  from  (4.5)  that  for  each  j,  M  =  ;  also 


hence  we  have 


0  » 


(4.7) 


M  =  Q.'-1-  M*  =  Q-1  ,  or 
S*  S* 


(4.6) 


M  =  M*  . 


This  equation  provides  a  numerical  check  of  the  accuracy  of  M* 
under  the  hypothesis  that  M*,  but  not  necessarily  M,  is  in  canonical  form. 
We  call  it  a  reverse  check  since  it  goes  from  M*  to  M  by  premultiplication 
with  a  "submatrix"  of  M. 


If  M  is  in  canonical  form  with  respect  to  a  basic  sequence 


S  =  (s  ,  ...,  s  ),  then 
1  P 

(4.9) 


M*  =  M*  M  ,  i.e.,  Q  =  M  _. 


We  call  (4.9)  a  direct  check  since  it  multiplies  M  by  a  "submatrix"  of  M* 
to  obtain  M*.  If  a  check  (either  direct  or  reverse)  discloses  an  error,  one 
can  then  check  in  intermediate  stages  and  soon  locate  the  source  of  the 
error.  For  example,  if  M*  is  the  result  of  12  cycles,  first  check  at  Cycle 
t.  If  this  is  correct* check  at  Cycle  95  if  it  is  incorrect  check  at  Cycle  3, 
etc. 


(23) 

Wi-  observe  that  when  both  M  and  M*  are  in  canonical  f'jrm,  then 


(4.10) 


MS.  M  s  ■  rp  *  1  ; 


thus  we  may  regard  the  simplex  method  as  being  a  systematic  procedure  for 
calculating  inverses  of  the  "subma trices"  M^of  M  and  for  specifying, by 
selection  of  S* ,  which  subma trices  to  invert. 


In  applying  (4.5') ,  given  M  and  Q,  >  there  is  no  need  to  calculate 
all  of  M*.  Indeed,  if  we  first  calculate  the  initial  row  of  M*  we  can 
either  (i)  find  an  index  k  for  which  c^*  <  0  or  (ii)  conclude  the 
M*  is  in  optimal  form. 


In  case  (i)  we  calculate  column  k  of  M*  and  conclude  either 
(iii)  there  is  some  a^  >0  or  (iv)  the  problem  has  no  optimal  solution. 
In  case  (iii)  we  calculate  column  0  (the  B  column)  and  apply  the  pivot 
selection  rule  to  determine  the  next  pivot  position  (h,k)  Let  M’  be  the 
nfetrix  obtained  from  M*  after  pivoting  at  (h,k).  Then 


(4.11) 


M'  =  F*M* 


where  F*  is  given  by  (4.4)  (calculated  from  M*  rather  than  from  M). 
Then,  from  (4.5)  we  get 

(4.12)  M'  *  Q'M  where  Q*  -  F*Q. 

This  completes  a  full  cycle  of  the  algorithm.  Note  that  this  form  of  the 
simplex  algorithm  requires  (i)  keeping  a  permanent  record  of  the  initial 
tableau  matrix  M,  (ii)  keeping  a  temporary  record  of  the  transforming 
matrix  Q,  which  is  discarded  in  favor  of  its  successor  Q'  when  it  Is 
calculated,  and  (iii)  calculating  the  final  row,  the  initial  column,  and 
one  other  column  of  M*.  These  are  used  to  decide  the  state  of  M*  and 
to  determine  F*  in  case  M*  is  ndt  in  a  terminal  state  and  then  are  dis¬ 
carded.  One  might  also  wish  to  (iv)  keep  a  record  of  the  basic  sequence 
S' . 


[The  revised  simplex  method  is  not  recommended  for  hand  calculation. 
For  that  matter,  hand  calculation  in  any  form  is  clearly  hopelessly  inef¬ 
ficient  for  all  but  very  tiny  problems.  Thus  we  must  evaluate  the  revised 
simplex  method  with  machine  calculation  in  mind.) 

It  is  not  particularly  difficult  to  devise  a  computer  program 
which  will  put  into  effect  the  four  parts  (i)  -  (iv)  of  the  revised 

simplex  method.  This  method  has  several  important  advantages  for  digital 
calculation  ; 


(29) 


(a)  All  calculations  are  based  on  the  initial  data,  and 

the  accuracy  of  Q  can  be  tested  by  reference  to  (4.7 ),  i.e,,  Q 

is  the  inverse  of  Mg*} 

(b)  If  the  original  data  is  sparse  (i.e, .has  many  zero  co¬ 
efficients)  the  calculations  vill  be  simpler  and  quicker  at  every  cycle 
since  this  sparse  matrix  M  is  used  throughout.  In  the  ordinary  algorithm 
sparseness  is  rapidly  lost.  Moreover,  very  large  sparse  problems  can 

be  handled  with  a  computer  memory  vhich  could  not  touch  nonsparse  prob¬ 
lems  of  the  same  size; 

(c)  The  fact  that  not  all  of  M*  need  be  calculated  gives 
substantial  computing  economy, 

In  most  current  computer  programs  the  method,  as  described, is 
modified  in  several  ways, of  which  we  discuss  only  one.  Instead  of  storing 
Q,  we  store  only  each  factor  F' ,  and  for  F'  we  store  only  its  non -identity 
column  together  with  a  record  of  the  pivot  position.  The  algorithm  in  this 
form  is  called  the  "product  form  of  the  revised  simplex  method"  and  several 
computer  algorithms  for  this  method  are  in  current  use.  We  do  not  discuss 
the  details  here. 


(50) 


5.  Duality 
V 

(5.1) 
and 

(5.2) 


We  consider  a  pair  of  related  problems 

T 

DV  <  R,  V  -  0,  maximize  P  V 


DTU  =  P,  U  ^  0,  minimize  RT  U, 


where  D  is  a  p-by-n  matrix  and  we  assume  that  problem  (5.1)  has  an 
optimal  feasible  solution. 

The  introduction  of  slack  variables  Y  in  (5*l)  leads  to  a 
problem  in  standard  form 

(5*5)  AX  =  B,  X  =  0,  minimize  z  =  d  +  X 

where 

(5-M  A  -  |  |D  I|  | ,  X  -  ||  ?  ||,  C-  ||  '5  ||,  B-R,  d  -0 

and  Cl  and  C2  hold  for  the  basic  sequence  S  =  (m  +  1,  m  +  p). 

Thus  as  initial  tableau  we  have 


(5.5) 


let  the  final  (optimal)  tableau  be 


(5.6) 


Next,  we. let 

(5.7)  u0  M  » 
and  then  from  (^.9)  we  get 

(5.8)  -d*  «  U  0T  R,  C*T  -  -PT  +  U  T  D. 


(31) 


Now,  since  (5*6)  is  in  optimal  form, we  have 


(5.9) 

or,  equivalently, 

(5.10) 


C*  >0,  C*  £  0, 

uo  5  0-  »T"0  5  p- 


Hence,  UQ  is  a  feasible  vector  for  the  dual  problem. 


!*t  XQ  = 


be  the  optimal  vector  for  the  primal  problem.  Then 


(5-11)  d*  =  CTXq  =  -PTVQ  ; 

her-ce 

(5.12)  PTVo  =  RT  u0  =  -d*  • 

For  any  feasible  vectors  V  and  U  for  (5«l)  and  (5*2)  we  have 

(5.13)  RTu  =  utr^  utdv  >  PTV. 


From  (5*13),  (5.12)  and  the  feasibility  of  VQ.,  UQ  ,  it  follows 
that  both  VQ  and  Itq  are  optimal.  Since  Tj  Q  appears  as  part  of  the  initial 
row  of  M* ,  we  see  that  the  simplex  algorithm  solves  both  of  the  related 
problems  simultaneously  and  that  the  objective  values  are  equal. 


■Hie  relationship  between  the  problems  (5*1)  and  (5.2)  is  an  inter¬ 
esting  special  instance  of  duality  in  linear  programming.  Even  this  special 
instance  has  a  number  of  important  applications;  for  example,  the  minimax 
theorem  of  game  theory  is  a  consequence  of  (5.11).  However,  the  special 
form  of  (5.1)  leads  us  to  ask  how  to  define  duality  fcr  more  general  linear 
programming  problems.  Indeed,  we  might  ask  about  the  characteristics  of  a 
duality  relation  in  still  more  general  situations. 

Let  E  be  a  set  of  objects.  A  mapping  T  of  £  into  itself  is 
said  to  be  involutory  if  YaeM  >  T(T(a))  *  a.  If  ^  is  an  equivalence  relation 
on  E  ,  T  is  said  to  equivalence  preserving  if  a  ^  a'  implies  T(a)  -o  T(a'). 

We  say  that  T  is  meaningful  relative  to  some  relation  R  on  Z  if 
for  all  acE,  aP.T(a). 

For  example,  let  E  be  the  set  of  all  linear  programming  problems 
each  of  which  has  either  form  (5*1)  or  the  form 


(5.2') 


FU  b  Q»  U  £  0,  minimize 


(32) 


and  let  T  be  the  mapping  used  above  which  sends  the  problem  of  (5*1)  into 
(5.2)  and  conversely.  In  more  detail. if  we  let  Z^  denote  the  set  of  all 
problems  of  form  (5.1) >  let  Zg  denote  the  set  of  all  problems  of  form  (5.2'), 
and  let  Z  =  T^U  Z^  then  T  interchanges  the  two  subsets  Z^,Zg  an<*  w^en 
applied  twice  to  a  fly  problem  in  Z  gives  back  that  same  problem.  The  invol- 
utory  property  of  T  depends  on  the  fact  that  transposing  any  matrix  twice 
gives  back  that  same  matrix.  This  mapping  T  is  not  only  involutory  but 
also  is  meaningful  relative  to  the  relation 


(5.1*0 


optimal  value  a  =  optimal  value  T(a). 


More  generally,  a  mapping  T  is  meaningful  if  there  are  useful  theorems  con¬ 
necting  a  and  T(a);  the  duality  theorems  of  linear  programming  are  of  this 
nature. 


A  mapping  T  which  has  all  three  of  these  properties  i.e. 
which  is  involutory,  equivalence  preserving,  and  meaningful  qualifies  as 
the  very  "best"  type  of  duality.  We  obtain  such  a  duality  for  linear 
programming  under  a  rather  general  equivalence  relation. 


We  now  denote  by  Z  the  set  of  all  linear  programming  problems 
in  the  general  form  (3.7)  -  (3.11).  Given  any  such  problem  called  the 
primal  problem,  we  show  in  Table  (5.1)  how  to  define  a  new  problem  in  Z 

mal.  We  may  characterize 
n^  +  n2  +  l)  matrix 


the  primal  problem  by  the  (p^ 


(5.1*0 


G  = 


*2 

+  1) 

-by. 

-d 

c  T 
°1 

c  T 
U2 

B1 

A11 

A12 

B2 

A21 

A22 

together  with  the  ordered  set  of  integers  (p  ,  p  ,  n  ,  n  )  which  indicates 
the  nature  of  each  constraint  and  variable.  Then  the  dual  problem  is 
characterized  by  the  matrix  -  G^  together  with  the  sequence  (n^,  n^,  p^,  p^). 

Clearly,  the  correspondence  between  primal  and  dual  is  involutary,  since, 
in  particular,  -(-GT)T  ■  G.  Note  that  the  dual  is  also  a  minimization 
problem;  each  inequality  in  the  primal  system  corresponds  to  a  non-negative 
variable  in  the  dual;  each  equation  in  the  primal  system  corresponds  to  a 
free  variable  in  the  dual.  Corresponding  to  each  non-negative  variable  in 
the  primal  system  is  an  inequality  whose  coefficients  are  the  negatives  of 
the  coefficients  of  this  variable  and  whose  right  hand  side  is  the  negative 
of  the  coefficient  of  the  variable  in  the  primal  objective  function; 


(55) 


Primal 

Dual 

min 

T  T 

C1  X1  +  C2  X2  +  d  =  z 

T 

min  -B  ■ 

T 

■  B  U  - 

2  2 

11 

V 

A  X  +  A  X  >  3 
11  1  12  2  -  1 

U 

1 

l>  V 

0 

V 

A_  X  +  A  Xrt  =  B 
21  1  22  2  2 

U2 

free 

n  : 
1 

X  >  0 

1  = 

T 

-An  V 

T 

A21*  U2 

n  : 
2 

free 

1 

ET* 

EH 

CM 

1 

T 

A22  U2 

11 

1 

O 

ro 

Table  5.1. 

Dual  Systems,  General 

Case. 

each  free  variable  in  the  primal  corresponds  to  a  similarly  defined  equation 
in  the  dual  system.  The  dual  objective  function  has  as  coefficients  the 
negatives  of  the  right  hand  sides  of  the  primal  restraints.  (The  minus  sign 
attached  to  d  in  (5 .1*0  is  necessary  for  uniformity  in  reading  relations 
from  the  matrix;  the  initial  row  represents  the  equation  z-d  =  Cg'TXg). 

This  duality  includes  the  previous  one  as  a  special  case  if  we 
observe  that  (5*1)  is  equivalent  to 

(5.2”)  -D  V  £  -  R,  V  >  0,  minimize  -  PT  V. 

Two  special  cases  of  the  general  linear  programming  problems  are 
of  frequent  occurrence.  The  case  p  =  q  =  0  (cf.  5.1)  is  called  the 
symmetric  form  and  is  the  one  that  arises  naturally  in  many  applications. 

The  case  p.  =  qg  =  0  is  called  the  standard  form  (cf.  Section  1  above). 

We  recall  that  reduction  to  standard  form  is  a  first  step  in  preparing  the 
problem  for  application  of  the  simplex  algorithm.  The  dual  of  a  symmetric 
problem  is  again  symmetric;  however,  the  dual  of  a  problem  in  standard  form 
is  not  in  standard  form. 

We  saw  in  Section  3  (cf.  (3>7')>  (3*8'),  and  (3.12))  how  to  change 
from  a  problem  in  symmetric  form  to  one  in  standard  form  via  introduction 
of  slack  variables.  Thus, if  we  can  show  that  a  problem  in  general  form  is 
equivalent  to  one  in  symmetric  form, we  will  have  established  that  each  of 
the  three  forms  actually  includes  all  problems.  To  do  this  we  first  replace 
the  free  vector  Xg  by  the  difference  X£  -  X£  of  two  non-negative  vectors, 
and  then  replace  each  equation  by  two  oppositely  directed  inequalities. 

This  yields  the  symmetric  system 

X  £  0,  min  z  *  d  +  CT  X 


(5.15) 


AX  ^  F 


where 


*11 

*12  ‘*12 

*21 

CM 

l 

CM 

A21 

‘*22  *22 

B, 

c 

X 

1 

1 

1 

B  = 

B2 

,  C  = 

C2 

X  = 

-B2 

-C2 

*2 

It  is  easy  to  verify  that  the  dual  of  (5-15)  is  equivalent  to  the  original 
dual;  i.e.,our  duality  is  equivalence  preserving. 

We  might  ask  more  precisely  in  what  sense  the  general  primal  form 
is  equivalent  to  (5 .15)*  Clearly, any  feasible  solution  X  of  (5*15)  defines 
a  feasible  solution  X^  ,  Xg  =  Xg  -  X^j  of  the  original  problem  and  with 
the  same  z  value.  Conversely,  if  X^  ,  X2  is  a  feasible  solution  for  the 
original  problem  and  if  -t  is  smaller  than  any  component  of  then 
*2  "  *2  +  t  V  X2  =  *  EP2,  (where  for  any  h  ,  is  the  vector  with  h 
components  all  equal  to  ofie)  gives  a  feasible  solution  X  for  (5»15)  with 
the  same  z  value.  Hence  either  the  two  problems  have  the  same  optimal 
value  or  neither  has  an  optimal  solution.  Alternatively,  given  Xg  one  can 
find  unique  XI,  X£  with  X^T  Xj  »  0  ,  X'  >  0,  X"  >  0,  and  Xg  =  X'  -  X"  . 
It  can  be  shovm,  moreover,  that  vie  extreme  solution^  of  the  two  systems  ^ 
can  be  put  into  one-to-one  correspondence;  this  means  that  the  arguments 
which  Justify  the  simplex  method  can  be  extended  to  show  how  to  obtain  all 
extreme  solutions  of  the  original  problem  from  the  basic  solutions  of  the 
equivalent  problem  in  standard  form. 


V2 

We  give  an  illustrative  example 

Primal 

in  which 

P,  -  1>  P„  3  2, 
1  2 

Dual 

min  2x^ 

+  3xg  +  2xj  -3xj^  +  7x^  +  2  ®  z 

min  - 

3^  +  7ug  -  2u^  - 

+  3x2  ‘  7x3  *  kxk  *  x3  "  3 

u  >  0 

1 

3X1 

-  *2  -  3x5  +  5x^  +  2x5  -  -7 

u  free 

2 

4x, 

+  2x„  -6x_  -8x.  -4x^  »  2 

u  free 

1 

2  3  4  5 

3 

(5.16) 

x  >  0 

-2u 

-  3u  -  4u  >  -2 

1 

1 

2  3 

x  »  0 

-3u 

+  u  -  2u  >  -3 

2 

1 

2  3 

x  >  0 

3 

7ui 

+  3u2  +  6u5  >  -2 

x,  free 

4u 

-  5u  +  8u  *  3 

4 

1 

2  3 

x  free 

-u 

-  2u  +  4u  ®  -7 

5 

1 

2  3 

The  corresponding  primal  problem  in  symmetric  form  has  (cf.  (5.16)). 


A  - 


2 

3 

-7 

-4 

1 

4 

-1 

3 

-1 

-3 
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-5 

-2 

4 

2 

-6 

-8 

-4 

8 

4 

-3 

1 

3 

-5 

-2 

5 

2 

-4 

-2 

6 

8 

4 

-8 

-4 

We  have  shown  that  our  duality  is  involutory  and  equivalence  pre¬ 
serving  so  far  as  transitions  from  general  to  » yens trie  to  standards  forms 
are  concerned.  We  conclude  the  present  section  by  stating  and  proving  the 
general  duality  theorem  fer  linear  programming  thus  shoving  that  duality  is 
meaningful. 


We  recall  that  for  a  general  linear  programming  problem  there  are 
three  mutually  exclusive  possibilities: 


(56) 


(a)  Uiere  is  a  basic  optimal  feasible  vector, 

(b)  Hie  problem  is  feasible  but  objective  function  has  no  finite 
lower  bound  for  feasible  vectors, 

(c)  'There  is  no  feasible  vector. 


The  fundamental  duality  theorem  states 


(i)  If  either  primal  or  dual  has  an  optimal  solution  so  does 
the  other, 

(ii)  If  both  primal  and  dual  are  feasible  then  both  have  optimal 
solutions. 

(iii)  If  either  primal  or  dual  is  feasible  but  with  no  lower  bound 
for  the  objective  function  then  the  other  is  infeasible. 

(iv)  Both  primal  and  dual  may  be  infeasible. 

(v)  If  (i)  holds  then 


(5.18) 


2  +  2  '  a  0 

opt  opt 


(vi)  If  X  ,  X  ,  and  U  ,  U  are  any  feasible  solutions  to  the 


primal  and  dual,  respectively,  fev 


en 


(5.19)  z  +  z’  ^  0. 

We  first  establish  (5.19)*  Wb  have 

t  +  z'  -  (ciTx1  ♦  c2tx2  ♦  d  )  -  (b1tu1  ♦  b2tu2  ♦  d  ) 

>ctx+ctx-ut(a  X  ♦  A  X) 
•11  2  2  1  11  1  12  2 

V<*21  X1  *  *22  V 

T  T  T 

«  (C  -U  A  -  U  A  )  X 
1  1  11  2  21  1 

♦  (C  T  *  U  T  A  -UTA  )  X 

2  1  12  2  22  22 

>  0 


Since  the  first  term  is  a  sum  of  products  of  non-negative  factors  and  the 
second  is  0.  Note  that  we  used  all  of  the  feasibility  conditions  for  both 
primal  and  dual  in  establishing  this  chain  of  Inequalities. 

Next,  we  assume  that  the  primal  problem  has  an  optimal  solution. 


(3?) 


(5.20) 


*■* 


whence  it  follows  that  U  ,  U  is  a  feasible  vector  for  the  dual  with 
objective  value  z'  *  -£*  =  -  z  ^  and  hence 

z  +  z '  =  0 
opt 

This  together  with  (5.19)  shovs  that  u  ,  U,  is  optimal  for  the  dual  with 
z'  =  z'  so  that  (5-18)  holds.  F^nce  duality  is  involutory  this 
es  ®ilishes  (i)  a:.d  (v). 

Next,  we  observe  that  if  both  problem*  are  feasible  then  (5*19) 
shows  that  neither  can  fall  in  case  (b)  and  this  establishes  (ii).  Finally, 
(iii)  and  (iv)  together  are  merely  the  contrapositive  statement  of  (l) 
and  (ii).  We  give  examples  to  illustrate  possibilities  (iii)  and  (iv). 

Hie  slef  dual  problem 

(5.21)  min  z  *  -x^  subject  tc  the  constraints  Ox,  5=  1,  ^  0 

illustrates  (iv),  and  the  problem 

(5.22)  min  z  *  -x^  subject  to  the  constraints  >1,  x^  ^  0 
illustrates  (iii). 


(39) 


6  .  Interpretation  of  Final  Tableau 

Suppose  that  the  M  of  (U.l)  is  the  final  (optimal)  tableau 
matrix  for  a  linear  programming  problem.  Each  number  in  M  has  an 
interpretation  in  the  language  of  activity  analysis.  The  entire  first 
column,  of  course,  gives  the  optimal  vector  and  minimal  value  of  the 
objective  function.  Each  other  column  can  Ve  used  to  describe  how  to 
adjust  the  optimal  program  as  a  prescribed  amount  of  a  nonbasic  activity 
is  introduced. 


(6.1) 


The  optimal  solution  is,  of  course, 

C 

xs^  =  b^  (i  -  l,...,p) 
Xj  =0  j  {  S 


Here  S  =  (s]_,...,Sp)  is  the  optimal  basic  sequence  and  as  before  we 
denote  by  G  the  set  of  n-p  indices  not  in  S  .  Let  k  be  a  nonbasic 
index  and  suppose  that  we  wish  to  introduce  t  units  of  activity  k 
into  our  program.  Then  the  new  "best"  solution  is 


(6.2) 


x'Si  =  bi  -  aik  b 

x  ’  .  =  0  j  e  G,  j  k 

J 

X’k  =t 
z '  =  d  +  c^  t 


We  consider  two  cases:  (i)  k  is  a  slack  index  and  (ii)  k 
is  a  natural  index.  Case  (ii)  is  simpler  and  we  treat  it  first. 


Natural  activities,  e.g.  building  picture  frames,  eating  steak, 
are  irreversible  hence  we  restrict  t  to  nonnegative  values.  The  upper 
bound  foi  t  is  determined  by  the  fact  that  if  any  a^  is  positive 
then  t  cannot  exceed  b^/aj^  lest  x'E^  become  negative.  Thus,  for 
t  we  have 

(6.3)  0  ^  t  £  min  {bi/ai k| aik  >  0;  i  =  1, ...,p} 

If  no  a^y  is  positive  there  is  no  finite  upper  bound  for  t  ;  occurence 
of  this  in  a  practical  problem  is  an  indication  that  some  constraint  may 
have  been  overlooked  in  setting  up  the  model. 


(4o) 

A  positive  value  for  a  slack  variable  indicates  that  not  all 
of  some  resource  has  been  utilized  or  that  some  requirement  has  been 
more  than  met.  A  negative  value  for  a  slack  variable  represents  an 
infeasible  solution  to  the  original  problem  but  may  have  a  useful  inter¬ 
pretation  for  slightly  modified  problems.  For  example,  to  consider  what 
would  happen  if  additional  units  of  some  resource  became  available  we 
consider  negative  values  for  the  corresponding  slack  variable.  Thus,  if 
xk  measures  labor  units  then  xk  =  -2  would  literally  mean  a  program 
which  exceeded  original  labor  availability  by  two  units  and  thus  (6.2) 
with  t  =  -2  can  be  used  to  describe  how  the  optimal  program  should  be 
altered  to  take  advantage  of  two  added  units  of  labor.  If  xk  measures 
excess  protein  then  t  =  -2  would  indicate  how  to  best  adjust  an  optimal 
diet  to  account  for  a  decision  to  reduce  the  minimum  daily  protein  re¬ 
quirement  by  2  units. 

For  a  slack  index  k  the  permissible  domain  for  t  is  given 
by 

(6.4)  max{bi/aik| ai}t  <0;  i  =  l,...,p}  g  t  g  min  {bi/aik|aik  >  0; 

i  =  1, . .  .,p}  . 

Here  as  in  (6.3)  we  may  have  one  or  both  limits  infinite.  For  any  t 
in  this  interval,  (6.2)  gives  the  best  adjusted  program. 

In  both  (6.3)  and  (6.4)  if  t  reaches  its  extreme  value  we 
have  the  same  result  as  would  be  given  by  a  pivotal  transformation  which 
brings  k  into  the  basic  sequence.  Any  further  changes  in  xk  must 
follow  the  rule  which  we  develop  below  for  changes  in  basic  variables. 

Before  discussing  changes  in  basic  variables  we  consider  the 
effect  of  changes  in  an  initial  resource  or  requirement  whose  corres¬ 
ponding  slack  variable  is  basic. 

If  a  basic  variable  xSh  measures  the  slack  in  resource  h 
then,  clearly,  no  increase  in  the  initial  supply  of  this  resource  can 
effect  the  optimal  program  or  its  cost,  nor  will  a  decrease  by  t  units 
provided  that 

(6.5)  t  S  xfh 

Similar  reasoning  applies  to  requirements.  For  example,  if  the  h-th 
constraint  is  the  minimum  protein  requirement,  then  an  increase  by  t 
in  this  requirement  will  have  no  effect  if  (6.4)  holds. 


(*1) 


[  The  final  tableau  matrix  does  not  indicate  the  amount  of  the 

1  h-th  resource  that  is  used  in  the  program.  Let  M'  denote  the  initial 

tableaux  matrix.  Then  b'>,  is  the  amount  of  resource  h  that  was 

n  S 

available  initially  and  b'^  -  x£^  is  the  amount  used  in  the  optimal 

program.  Similarly,  for  the  vitamin  illustration  b'^  was  the  original 

minimum  daily  requirement  and  bV  +  xG  is  the  amount  of  protein 

n  sh 

actually  supplied  by  the  optimal  diet. 

Next,  let  sh  be  any  basic  index  and  consider  the  effect  of 
changing  Xg^  by  some  positive  or  negative  amount  t  .  According  to 
(6.2)  this  can  be  dene  by  choosing  some  nonbasic  index  k  anc.  letting 
r  =  -a^jj  t  .  In  this  analysis  we  wish  to  preserve  the  original  constraints 
and  hence  require  t  ^  J  . 

For  t  >  0  we  must  choose  k  such  that  a^  <  0  ,  then  the 
unit  cost  for  increasing  xs^  is  -c^/a^  and  hence  the  best  k  to 
introduce  is  given  by 

(6.6)  "ck/ahk  *  min  {~cj/ahj!ahj  <  °»  0eG} 

'  Similarly,  for  t  negative  the  unit  cost  and  best  k  are  given  by 

f 

[  ck/ahk =  min  >  °*  jeG> 

|  Whether  t  is  positive  or  negative  when  t  takes  its  upper 

bound  as  given  by  (6.3)  say  r  =  hs/ask  we  have  for  the  total  increase 
in  cost 

(6.8)  lckbs/askl 

and  for  further  change  in  xs^  we  repeat  the  entire  process  beginning 
with  the  tableau  M*  obtained  from  M  after  pivoting  at  (s,k)  .  We 
must  take  into  account  the  fact  that  M*  is  not  in  optimal  form  but 
this  does  not  effect  (6.2).  The  fact  that  one  or  more  of  the  c*^  may 
be  negative  gives  us  no  difficulty,  provided  that  we  add  the  requirement 
0^  >  0  in  selecting  the  new  k  in  (6.6)  and  (6.7). 

We  have  discussed  the  effects  of  changing  the  original  constant 
terms  b^ 1  .  Changes  in  a  coefficient  a^  for  k  nonbasic  has  no 
effect  on  the  optimal  program  unless  c^  becomes  negative  when  we  multiply 

M1  by  Q  to  get  M  >  Of  course,  the  a^  in  (6.2)  will  have  to  be 

adjusted,  and  if  c^  becomes  negative  further  pivots,  beginning  in  column 
k  will  be  required  to  obtain  the  new  optimal  program.  Changes  in  a^ 
for  k  basic  require  adjustments  in  Q  and  hence  possibly  in  all  of 

M  ;  small  changes  may  not  change  the  optimal  basic  sequence. 


(h2) 


Changes  in  c'^  for  k  nonbasic  are  reflected  by  exactly 
the  same  changes  in  ck  ;  i.e.  if  c'k  is  replaced  by  c’k  +  t  then 
c^  is  also  replaced  by  c^  +  t  .  The  value  t  =  -  :k  is  the  breakeven 
value  and  represents  the  amount  by  which  c'k  must  be  changed  to  make 
the  k-th  activity  competitive  economically  with  the  basic  activities. 

If  cs^  is  replaced  by  cs^  +  t  and  we  subtract  t  times 
row  h  of  M  from  row  zero  to  restore  canonical  form  ck  is  replaced 
by  ck  -  t  ahk  for  each  keG  .  Hence,  if  t  lies  in  the  interval 

(6. 9)  max  {ck/ahk|ahk  <  0,  keG}  <  t  <  min  {ck/ahk|ahk  >  0,  keG} 

the  optimal  basic  sequence  S  and  the  corresponding  optimal  solution 
(6.1)  will  remain  unchanged  except  that  zs  will  be  replaced  by 
z®  -  t  bjj  . 
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I.  FINITE  DIMENSIONAL  PROBLEMS 

* 

Statement  of  the  Basic  Problem 

Let  f :  En  -*•  E*  and  r :  En  -*  Em  be  continuoualy  differentiable 
functions,  and  let  fl  C  En  be  a  subset  of  En.  The  Basic  Problem  can 
be  stated  as  follows: 

Find  a  vector  z  t  En  such  that 
(i)  %  «  0,  r  (z)  =  0, 

(ii)  for  all  z  t  12  with  r  (z)  =  0,  f  (z)  S  f  (a). 

We  shall  call  a  vector  z  satisfying  (i)  and  (ii)  an  optimal  solution  to  the 
Basic  Problem. 

Necessary  Condition  for  Optimality 

The  necessary  condition  to  be  derived  will  be  stated  in  the  form 
of  an  inequality  which  is  valid  for  all  6z  =  (6z^,  6Z2,  .  .  .,  5*  )  in  a 
convex  cone  "approximation"  or  "linearization"  of  the  set  (2.  We  shall 
make  use  of  two  kinds  of  "linearizations"  of  the  set  0  at  a  point  z.  The 
first  one  will  be  defined  now;  the  second  one  will  be  defined  after  the 
proof  of  Theorem  1,  to  obtain  an  extension. 
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Definition.  A  convex  cone*  C(s,  0)  C  En  will  be  called  a 


linearisation  of  the  first  kind  of  tho  constraint  set  ft  at  s  if  for  any 

12k 

finite  collection  {  6c  ,  6s  ,  .  .  . ,  6s  }  of  linearly  independent  vectors 
in  C(s,  0)  there  exists  an  <  >  0,  possibly  depending  on  s(  6*1,  6 s^, 

.  .  6sk,  such  that  co{s,  x  +  «  6s1,  .  .  .,  s  +«  6sk)  C  0. 

If  the  cone  C  (s,  0}  is  a  linearisation  of  the  first  kind,  then  for 
every  6s  <  C  (s,  Si)  there  exists  an  >  0  such  that  s  +  c  6s  «  0  for 
all  «  such  that  0  S  «  S  «^.  The  largest  cone  having  this  property  is 
given  a  special  name. 

Definition.  The  radial  cone  to  the  set  0  at  a  point  s  t  ft  will  be 
denoted  by  RC  (s,  Si)  and  is  defined  by 

RC  (s,  Q)  m  {6s  :  s  +  «  6s  «  0  for  all  «  such  that 
0  S  s  S  €j(S,  6s)  >  Q) 


A  set  C  ie  a  ccne  with  vertex  xq  if  for  every  x  •  C,  x  4  xq, 

xq  +  \(x  -  xq)  «  C  for  all  \  >  0.  Since  the  vertex  xq  of  the  cone 
C  will  normally  be  obvious,  we  shall  omit  mentioning  it. 

t  -  1  k  1 

'  co{s,  s  +  «  6s  ,  .  .  . ,  s  +  «  6s  }  is  the  convex  hull  of  s,  sli  6s , 

]r 

.  .  .,  s  +  «  6s  ,  t.e.,  the  set  of  all  points,  y,  of  the  form 

y«HQS  +  (i1(a+«  8s1)  +  .  .  .  +  Hjj(s  +  «  6s*1),  where 
k 

£  If.  •  1,  m  £  0  for  all  i. 
i-0  1  ^ 
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Whenever  the  radial  cone  RC(z,  ft)  is  a  linearisation  of  the 
first  kind,  it  contains  all  the  other  linearisation  of  the  first  kind  of  the 
set  ft  at  z.  Consequently,  in  the  various  theorems  to  follow,  the 
radial  cone  RC  (z,  ft)  should  always  be  used  if  possible,  since  this  will 
result  in  stronger  necessary  conditions. 

Next,  we  define  the  m,ap  F:  En  -*  Em+* 

F(s)  =  (f(Z),  r  (z) )  . 


We  shall  number  the  components  of  Em+*  from  0  to  m,  i.e.,  y  «  Em+* 
o  1  m 

is  given  by  y  =  (y  ,  y  ,  .  .  .,  y  ).  The  Jacobian  matrix  of  the  map  ■ 

F(z),  (  will  be  denoted  by  --jfjp'  • 

For  the  Basic  Problem  stated  above,  the  following  theorem 
gives  a  necessary  condition  for  optimality. 


Theorem  1.  If  z  is  an  optimal  solution  to  the  Basic  Problem,  and 
C  (z,  ft)  is  a  linearization  of  the  first  kind  of  ft  at  z,  then  there  exists 
a  nonzero  vector  t|i  *  (t|>e,  .  .  . ,  t  Em+*,  with  *J>°  S  0,  such 

that  for  all  6z  «  C(  z,  ft)  (the  closure  of  C  (z,  ft)  in  En) 


<  6*>  S  °- 


Proof.  Let  K(a)  C  Em+^  be  the  cone  defined  by 


K(S)  -  C(s,  ft) 


K  (*)  is  convex  because  C  (z,  fl)  in  convex  and  is  a  linear  map. 

Let  y  =  F(z).  We  shall  non  show  that  the  cone  {y}  +  K(z)  must  be 
separated  from  the  ray 

(3)  R  =  { y ••  y  =  y  +P(-i.  0 . 0)  .  p  *  0}  . 

i.  e.  that  there  must  exist  a  nonzero  vector  ^  *  Em+*  such  that 

(4)  (i)  y  -  y]>  5  0  for  every  y  «  {£}  +  K(z) 

(ii)  <  y  -  y  o  for  every  y  «  R. 

Suppose  that  the  cone  {y}  +  K(z)  and  the  ray  R  are  not 
separated.  Then  the  cor.e  K(z)  must  be  of  dimension  m+1  and  R  must 
be  an  interior  ray  of  {y}  +  K  (z)  (i.  e. ,  all  points  of  R  except  y  are 
interior  points  of  {y}  +K(*)). 

Let  us  now  construct  in  the  cone  {y}  +  K  (z)  a  simplex  X  with 
vertices  y,  y  +  6y*,  y  +  5y*,  .  .  . ,  y  +  6ym+*  such  that 

(i)  there  exists  a  point  y  on  R  (which  we  shall  write  as 

y  *  y  +  6y°,  6y°  *  y(-l,  0,  .  .  0)  with  y  >  0),  dif¬ 

ferent  from  y,  which  lies  in  the  inteior  of  £ , 

(ii)  there  exists  a  set  of  vectors  6x*  t  C  (a,  Q)  satisfying 


(5)  5y*  *  U\  i  =  i . . 

and  surh  that 

(6)  co{z,  t  +  6z*,  .  .  z  +  6zm+*}  C  Q  • 

It  is  possible  to  satisfy  (i)  because  R  is  an  interior  ray  of  the  m+1  dimen- 

(‘0 

sional  cone  {y}  +  K(z),  and  it  is  possible  to  satisfyAbecause  -  C  (I,  Q) 
is  a  linearization  of  the  first  kind.  Note  that  the  vectors  6a*,  i  *  1,  .  .  . , 
m+1  are  linearly  independent  since  the  vectors  6y\  5y^,  .  .  . ,  6ym+* 
are  linearly  independent. 

For  0  <  a  S  1,  let  S*  C  E  be  a  sphere  with  center  y  +  a  6y° 
and  radius  err,  where  r  >  0.  This  sphere  can  always  be  constructed 
because  y  +  6y°  is  an  interior  point  of  E.  For  a  fixed  *,  0  <  o  &  1, 
we  now  construct  a  map  G  from  S  -  {y  +  arSy0}  into  Emn  as 
follows.  For  any  x  «  -  {y  +  a  6y°}  ,  let 

(7)  Ga(x)  *  F(4  +  ZY‘l(e«y°  +  x))  -  (y  +  «6y°) 

where  Y  ie  a  (m+1)  X  (m+1)  matrix  whose  _ith  column  is  6y*,  i  »  1, 

.  .  .,  m+1,  and  Z  is  a  n  X  (m+1)  matrix  whose  Jtth  column  is  6s*.  The 
matrix  Y  is  invertible  because  the  6y*  form  a  basis  for  Em**,  by 
construction. 

Expanding  the  right  hand  side  of  (7)  about  i,  we  get 


-S- 


'-W*' 


(8) 


(x)  *  y  +  ZY'1  (or  6y°  +  x)  -  (y  +  a  6y°) 

+  ©{ZY”l(«  6y°  +x)) 

where  oM  ie  a  continuous  f  ncticn  such  that  lim  « o. 

* .  _  II yll -*0  uyn 

L  »  Y  t  OrtJ  lv?rtce  eg)  i,n,pl.?*e'i  ** 

(9)  C  (x)  *  x  +  oiZY‘J  (e  6y°  +  x)  ) 

or 

Now,  for  x  «  8  (Stf  -  {y  +  o  6y°}  }  (the  boundary  of  the  sphere)  iixii  =  or 
and  we  may  write  x  =  op^,  where  u  p^ii  =  r.  Hence,  for  xt  d[Sg- 
<y  ♦  a  6y°>  ) 


(10)  Ca(«Pi)  *  •Pi  +  o  (o  ZY”1  (6y°  +  ) 

Consequently,  there  exists  an  «*,  0  <  «*  SI,  such  that  for  all 
P!  «  E"1*1.  with  sp^H  «  r, 

(11)  !«(•*  Si  *1  (6y°  ♦  Pl))  |  <  o*  r 

We  now  conclude  from  Brouwer's  Fixed  Point  Theorem 

that  there  exists  ax«S,<-{y  +  o*  6y°}  such  that 

e 

(12)  G  ,  (5)  »  0. 

df,  »  -  o(*c<  f,))  *  *+"<*  *(,  - 
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i*  | 


F<*+Zr‘V  Sy°+*|)  =  J  +  «*  6y° 


Now  J  +  .*  Jy°  ,  col  (f(J)  .  o.  0 . 0),  wh.r.  y  >  0. 


Thu*. 


expanding  (13), 


r(i  +  ZY'^o*  6y°  t?))  »  0 


on  <ii  t  zy-1  (.*  6y<> ,  .•  Y  <  f(J) 

rurthormor..  b.c»u..  o 1  (6)  ttd  to.  /.«  th*«  for  tty  «y  to  to. 

I  -  (?)  ,  to.  wetor  .  .  a  ♦  ZY’1  «y  bdoog.  to  co{i,  i  +  „* 

f  *  S.m+1> .  . 


t  ♦  HY'1  («*  iy°  +  ft )  ,  Q 

Httc.  ft  1.  oot  opttou.  which  1.  .  cowrttlclloo.  W.  to.,*.,.  con. 
elwl.  toot  to.  ott.  <}}  ♦  K(i)  ud  to.  r„  R  mwtb.  ..^wtod,  1. , 
there  muet  exiet  e  aonsero  vector  *  «  *•**  euft|,  ^ 


{i)  K  <r  -  r)>  *  o  for  every  y  «  ft)  ♦  K{t) 


•  /  — 


and 


(18)  (ii)  <4-,  (y  -  y)^>  -  0  for  every  y  «  R 

Substituting  (2)  in  (17),  we  have 

(W  <  *.  6z^>  £  0  for  every  6z  «  C  (&,  ft) 

Clearly,  (19)  must  also  hold  for  every  6z  c  C  (ft,  ft)  . 

Substituting  for  y  from  (3)  in  (18),  we  have 

(20)  <4/,  (-1,  0 . 0)>  =  -4;°  a  0  . 

This  completes  the  proof. 

2 

It  has  been  pointed  out  by  Neustadt  [  J  that  Theorem  4  remains 
valid  under  the  relaxed  assumption  that  C(z,  ft)  is  a  linearization  of  the 
second  kind  of  (1  at  z,  defined  as  follows. 

Definition.  A  convex  cone  C(z,  ft)  C  En  will  he  called  a  linearisation 

of  the  second  kind  of  the  constraint  set  n  at  z,  if,  for  any  finite  collec- 
12k 

tion  {6z  ,  6z  ,  ,  .  . ,  6z  }  of  linearly  independent  vectors  in  C(s,  ft), 

1  k 

there  exists  an  «  >  0,  possibly  depending  on  s,  8a  ,  .  .  ,,  8a  ,  and  a 

1  k 

continuous  map  ;  from  co{  z,  a  +  «  6s  ,  .  .  a  + «  8z  }  into  ft,  such 
that  £(s  +  8a)  ■  a  +  6a  +  o(8z),  where  lim  llll J1  ■  o. 

||  6s  ||  ■*  0  II 8*  I 
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Remark.  We  observe  that  if  C  (z,  ft)  is  a  linearization  of  the  firat  kind 


of  ft  at  z,  then  it  is  also  a  linearization  of  the  second  kind  of  ft  at  z, 
with  the  map  J  being  the  identity.  Thus,  unless  we  have  specific  cause 
to  indicate  whether  a  cone  C(z,  ft)  is  a  linearization  of  the  first  or  second 
kind,  we  shall  refer  to  it  simply  as  a  linearization  of  ft  at  z.  We  now 
restate  Theorem  1  in  this  form. 

FUNDAMENTAL  THEOREM 

If  z  is  an  optimal  solution  to  the  basic  problem  and 
C(£,  ft)  is  a  linearization  of  ft  at  z,  then  there  exicts  a  nonzero  vector 
*j,  =  (vj,°,  *m)  *  Em+~  with  tj,0  S  0,  such  that  for  all  6z  <  C  (I,  ft), 

(the  closure  of  C  (z,  ft)  in  E*)  j  /  ty,  -•  6z  ^  SO. 

The  reader  may  easily  modii'v  the  proof  of  Theorem  1  so  as  to  apply  to 
Theorem  1*.  Finally,  it  should  be  pointed  out  that  all  conditions  such 
as  continuity  differentiability,  etc.,  imposed  on  the  various  functions 
need  only  hold  in  a  neighborhood  of  the  optimal  point. 


We  shall  now  show  how  a  number  of  classical  optimization  prob¬ 
lems  can  be  cast  in  the  form  of  the  Basic  Problem,  and  we  shall  then 
apply  Theorem  1  or  Theorem  1*  to  rederive  several  classical  conditions 
for  optimality,  as  well  as  to  obtain  some  new  ones. 
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Classical  Theory  of  Lagrange  Multiplier 

The  classical  cone  trained  minimization  problem  admits  equality 
constraints  only.  Thus,  it  is  the  3asic  Problem  with  0  *  En,  the  entire 
epace.  Clearly,  En  is  a  linearisation  of  the  first  kind  for  En  at  any 
point  s  «  En. 

Thus,  we  conclude  from  Theorem  1  that  if  c  is  an  optimal 
solution  of  the  Basic  Problem,  with  0  *  En,  then  there  exists  a  nonzero 
▼ector  tjj  «  such  that 

(21)  <  4*.  Sz  )  SO  for  all  6z  t  En 

This  may  be  rewritten  as 


(22)  t|f,  bt'y  S  0  for  all  6z  «  En 


Since  for  any  6s  «  En,  -6s  is  also  in  En,  we  conclude  from  (22)  that 
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(24) 


m 

4»°  Vf(z)  +  ^  *}»'  Vr^z)  =  0 


We  have  thus  reproved  the  following  classical  result. 


12m 

Theorem  2.  Let  /(•),  r  (•),  r  (•),  .  .  r  (• )  be  real  valued,  con¬ 
tinuously  differentiable  functions  cn  En.  If  z  €  En  minimizes  f(z)  sub¬ 
ject  to  the  constraints  r*(z)  =  0,  i  =  1,  2,  .  .  m,  then  there  exist 

o  1  m 

scalar  multipliers,  ^  ,  4<  ,  .  .  . ,  *)>  ,  not  all  zero,  such  that  the 
function  H  on  En  which  they  define  by 


rn 

H(z)  =  f(z)  +  ^  4il  rl(z) 


has  a  stationary  point  at  z  =  z,  i.  ,  (24)  is  satisfied. 

It  is  usual  to  assume  that  the  gradient  vectors  Vr*(z),  i  *  1, 

2,  .  .  . ,  m,  are  linearly  irrepi  i'.  U;nt  for  all  s  such  that  (z)  a  0.  This 
m 

precludes  ^  Vr*(j)  =  0  and  hence  in  (24)  t}>0  i  0.  Multiplying  (24) 
i=l 

by  1  /4»°  *nd  letting  i  =  1,  2,  .  .  . ,  m,  we  now  deduce  the 

more  commonly  seen  condition. 


Theorem  2’.  If  z  minimizes  f(z'  subject  to  r(z)  *  0,  and  the  gradients 

Vr^(z),  t  8  1,  2 . .  are  linearly  independent,  then  there  exists 

a  vector  X,  «  Em  such  that  the  Lagrangian  L  on  £n  X  Em,  defined  by 


(26) 


m 

L(s,  X)  -  f<*}  +  ^  X1  /(*) 
•  1=1 


has  a  stationary  point  at  (s,  \). 

W*  not*  that  by  (24)  —  *  0  and  that  *  r(s)  =  0, 

by  assumption. 


Nonlinear  Programming 

Lot  £ :  E11  -*  E1,  r  :  En  «*  Em,  and  q :  En  •*  Ek  be  continuously 
differentiable  functions.  The  standard  Nonlinear  Programming  Problem 
is  that  of  minimising  f  (*)  subject  to  the  constrains  that  r(s)  «  0  and 
q(s)  at  0. 

This  corresponds  to  the  special  case  of  the  Basic  Problem,  with 
0  *  {«  •  q(x)  SO}.  We  shall  now  show  how  Theorem  1  can  be  used  to 
obtain  various  commonly  known  necessary  conditions  for  k  to  be 
optlaal. 


Given  a  particular  point  s  «  ft,  we  shall  often  have  occasion  to 
divide  the  components  of  the  inequality  constraints  functions,  q1, 

1  •  1,  ....  k,  into  two  sets;  those  for  which  q^s)  ■  0  and  those  for 
which  q*(s)  <  0.  To  simplify  notation  we  introduce  the  following 
definition. 
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Definition.  For  a  <  ft,  let  the  index  eet  I(s)  be  defined  by 


(27)  I(*)  *  {i :  q1(*>  =  0} 

The  constraints  q\  i  <  1(a)  will  be  called  the  active  constraints  at  a. 
We  shall  denote  by  TJz)  the  complement  of  I(z)  in  {1,  ....  k}  . 

The  set  ft  «  {z  ;  q(z)  £  0}  introduced  above  is  assumed  to 
satisfy  the  following  condition: 

♦  a  si 

Assumption  (Al).  1  Let  s  «  <1  be  an  optimal  solution^the  Nonlinear 
Programming  Problem.  Then,  there  exists  a  vector  h  «  En  such  that 

<  V^(*).  h>  <  0  for  all  i  «  I(J) 

A  sufficient  condition  for  (Al)  to  bu  satisfied  is  that  the  vectors  Vq*(s), 
i  «  l(s)  be  linearly  independent  (see  Corollary  to  Lemma  3). 


Definition:  For  any  tiQ,  the  internal  cone  of  ft  at  a,  denoted  by 

IC(z,  ft),  is  defined  by 


IC(a,  ft)  «  {5a  *  <  V^(*),  6s  >  <  0  for  all  i  «  I(a)> 


'  When  some  of  the  functions  q*,  i 
require  that  there  exist  a  vector 
for  these  functions  and  <  Vq*(a), 
q»,  i  «  1(a). 


<  1(a),  are  linear,  it  suffices  to 
h  <  En  such  that  <  Vq*(s),  h  ^  *  0 
h  >  <0  for  the  remaining  functions 


By  assumption  (Al),  the  convex  cone  IC(z,  ft)  is  nonempty.  It  is  s 
simple  exercise  in  the  use  of  Taylor's  Theorem  to  prove  the  following 
lemma. 

Lemma  1.  If  IC(s,  ft)  i  the  empty  set,  then 

(i)  lC{s,  ft)  is  a  linearisation  of  the  first  kind  of  ft  at  s, 

<ii)  16(s.  ft)  »  {6z :  <  7^(s),  6z>  a  0  for  all  i « I(s)> 

When  specialised  to  the  Nonlinear  Programming  Problem,  Theorem  T 
assumes  the  following  form. 

Theorem  3.  If  a  is  an  optimal  solution  to  the  Nonlinear  Programming 
Problem,  with  (Al)  satisfied,  then  there  exists  a  nonzero  vector  c  Em+1, 
with  t|»°  S  0,  such  that  for  all  6s  t  ft)  •  {6s  *  <7qV*)»  6s^  *  0 
for  all  i  «  1(a))  , 


S  0 


where  H(s)  «  ^  i(s) 


•I 

i*l 


r^s) 


Using  Theorem  3  and  Parkas  Lemma  we  obtain  the 

following  necessary  condition  for  optimality,  Which  is  in  a  form  more 
familiar  to  specialists  in  mathematical  programming. 
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Theorem  4.  If  z  is  an  optimal  solution  to  the  Nonlinear  Programming 
Problem,  with  (Al)  satisfied,  then  there  exist  a  nonzero  vector 
«  Em+\  with  4^°  *  0,  and  a  vector  p  *  E^,  with  |t  i  0(  such  that 


m  k 

(i)  4-°  Vf(z)  +  ^  ^  Vri(z)  +  ^  p  Vq1^)  =  0 

i=l  i=l 

and 

k 

(ii)  £  ji1  qi{z)  =  0 
i=l 


Proof.  From  Theorem  3, 

<i#i.  «•>  so 

for  all  6z  such  that  ^Vq^z),  6z  )  5  0,  i  «  1(a). 

i  a 

By  Farkas  Lemma,  there  exist  scalars  p  S  0,  l  «  1(a)  such  that 

^  *1  *‘<S)  -  0 

UI(z) 


Let  ii  ■  0  for  i  «  I(S).  This  co-n  .leus  ihe  pi-oof. 

Most  of  the  other  well  -  vtr.own  necessary  conditions  for  Ton' 
linear  Programming  Problems  can  be  obtained  from  Theorem  4  by 
making  additional  assumptions  on  the  functions  r  and  q.  For  example, 
the  following  corollaries  to  Theorem  4  are  immediate  consequences  of 
that  theorem. 
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Corollary  1.  If  assumption  (AJL)  is  satisfied  and  the  vectors  Vr*(»), 
i  a  1,  .  .  . ,  m,  are  linearly  independent,  then  there  exist  vectors 
tj,  «  Em+*,  (*  *  which  satisfy  the  conditions  of  Theorem  4  and  such 
that  (4»°,  p)  i  0. 

Corollary  2.  If  Vr*(s),  i  *  1 . .  together  with  7q*(*), 

i  «  1(a),  are  linearly  independent  vectors,  there  exists  a  vector 
i|i  <  E1*1^  satisfying  the  conditions  of  Theorem  4  with  <Ji°  <  0. 

The  assumption  in  Corollary  2  is  a  well-known  [ii]  sufficient 
condition  for  the  Kuhn-Tucker  constraint  qualification  to  be  satisfied. 
When  it  is  added  to  Theorem  4  we  obtain  a  slightly  restricted  form^ 
of  the  Kuhn- Tucker  Theorem  [4]> 

Corollary  3.  If  there  exists  a  vector  h  «  En  such  that  <.Vq*(a),  h) 
for  all  i  «  I(s),  <  Vr4(s),  •  0  for  i  *  1,  ....  m,  and  the  vectors 

7r*(s),  i  ■  1  ,  .  .  . ,  m,  are  linearly  independent,  then  there  exists  a 
vector  ^  satisfying  the  conditions  of  Theorem  4  with  4»°  <  0. 

The  assumption  in  this  corollary  is  a  sufficient  condition  for 
the  weakened  constraint  qualification  [|3J  to  be  satisfied.  Augmented 
by  this  assumption.  Theorem  4  becomes  a  slightly  restricted  form^  of 
the  Kuhn-Tucker  Theorem  with  the  weakened  constraint  qualification  , 


T 


In  practise,  the  Kuhn-Tucker  constraint  conditions  can  rarely  be 
shown  tc  be  satisfied  unless  the  restrictions  imposed  in  Corollaries 
2  and  2  ho  d. 


Ill  THE  MAXIMUM  PRINCIPLE 


To  illustrate  the  applicability  of  the  theory  *uat  developed,  ve 
shall  use  it  to  obtain  the  Pontryagin  Manlaua  Principle  for  optimal  control 
problems. 

Consider  a  dynamical  system  described  by  the  differential  equation: 

1.  dx  ..  . 

-  h(x,u) 

for  all  t  in  the  compact  interval 
the  state  of  the  system  at  time  t, 

of  the  system  at  time  t,  and  ia 

.  _n 

range  in  E 

The  Fixed  Tl—  Ogyfifl  Control  Problem  ia  that  of  finding  a  control 
Q(t)  ,  t  c  I  ,  and  a  corresponding  trajectory  *(t)  ,  determined  by  (  )  , 
euch  that 


I  •  [t^.tj]  ,  where  x(t)  c  E°  la 
u(t)  c  E*  is  the  input  or  control 
«;  function  defined  an  En  x  ^  with 


2.  for  t  c  I  ,  0(t)  is  a  measurable,  essentially 

bounded  function 
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whose  range  is  contained  in  an  arbitrary  but  fixed  subset  U  of  Em; 

x(t^)  =  where  x^,  a  fixed  vector  in  En,  is  the  given 
initial  condition; 

4.  x(t2)feX^,  where  X^  =  {xtEnjg(x)  =  0},  and  g  maps  En  into 

E*  {X^  is  the  fixed  target  set)  ; 


for  every  control  u(t),  and  corresponding  trajectory 

x(t),  satisfying  the  conditions  {2  ),  (  3  ),  and  (4  ), 


2  0  0  . 

{<x(t),u(t))dt£  f  (x(t),  u(t))dt 


where  f  (.2.)  la  a  cost  function  mapping  En  x  E®  into  EP 


We  make  the  following  assumptions: 

6.  "^lie  functions  h<«  j*)  and  are  continuous  in  both  x  and 

u,  and  are  continuously  differentiable  in  x^ 

7.  The  function  g(»1  is  continuously  differentiable  and  the  corresponding 

-  >  8k{x) 

Jacobian  matrix  -f —  is  of  maximum  rank  for  every  x  in  X. . 

ox  1  2 

To  transcribe  the  control  problem  into  the  form  of  the  Basic 
Problem  we  require  the  following  definitions: 
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14.  where  _X^  =  {etE***  j  g(P2z)  “  »  where  8  maps 

£n  into  E*  ; 

15.  for  every  control  .u(t),  with  tfl,  and  corresponding  trajectory 

» 

*(*)#  eatisfying  (u)  and  the  conditions  (12),  (H),  and  (1A )  above, 
pi*(t2) 

Finally,  we  complete  the  tranacription  of  the  optimal  control 
problem  Into  the  form  of  the  Basic  Problem  by  defining 

w.  4  (*>  = 

17.  r(z)  =  g(P2*(t2)), 

by 

IB.  and  we-let4I2^>e  the  set  of  all  absolutely  continuous  functions  z 
from  I  into  E^*  which,  for  some  measurable,  essentially  bounded 
function  u  from  I  into  U  C  E*”,  satisfy  the  differential  eqi  ation  (u) 
for  almost  all  t  in  I,  with  z(tj)  =  (0,  xQ) . 

IS.  Remark ;  It  is  clear  that  with  $ ,  r,  and  i2  defined  as  in  (16),  (17 ), 

and  (18),  respectively,  we  have  trarscr'bed  the  optimal  control  problem 

*  * 

*  n'  “ 

into  the  form  of  the  Basic  Problem  \  > .  We  shall  call  the  transcribed 

a 

.  optimal  control  "the  optimal  control  problem  ir.  standard  form.1* 

We  still  have  not  defined  the  linear  topological  vector  space  «£  . 
From  (18)  it  is  clear  that  ft  is  a  subset  of  the  linear  space  of  all 
absolutely  continuous  functions  from  I  into  E***.  However,  since  wc 
wish  to  use  a  linearization  constructed  first  by  Pontryagin  et  al.  [  '  , 

wc  find  it  necessary  to  imbed  ft  into  a  larger  linear  topological 
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apace  which  we  define  below. 

Let  It  be  the  set  of  all  upper  semi 'Continuous  real  valued 


functions  defined  on  I,  and  let 


jl  -  U-U. 


From  the  properties  of 


upper  arid  lower  semi-continuous  functions  it  follows  that 

jS  is  a  linear  vector  space.  We  then  define  Si  to  be  the  Cartesian 
product  with  the  polntwise  topology,  [15]  i.  e. , 

the  topology  which  is  constructed  from  the  sub-base  consisting  of  the 
family  of  all  subsets  of  the  form  {f  :f(t)eN},  where  t  is  a  point 
in  land  N  is  an  open  set  in  E**\ 


It  is  easy  to  show  that  4  and  r,  as  respectively  defined  by  (16) 
and  (17),  are  continuous*  . ^  ^  * 

Let  z  (t),  corresponding  to  the  control  u  (t),  be  ^  solution  to 


the  optimal  control  problem  in  standard  form  (1 9j.  We  now  proceed  to 
construct  a  linearization  for  the  constraint  set  0  at  z . 

Let  lxC  I  be  the  set  of  all  points  t  at  which  u(t)  is  regular, 

20.  1,  «  Ct|.  <  t  <  t2,  Hm  ‘  '•  ,0r  *V*ry 

measJT)-*!)  ' 

neighborhood  N  of  u(t),  ttTC  I}* 

m 

Definition:  A  real  valued  function  f : E*-* E*  is  called  upper  semi- 
.  ■  • 

continuous  at  a  point  t.  In  e\  if  lim  supf(t)  <  f(tQ).  And  it  is  called 

t-*t- 

lower  semi-continuous  if  -f  is  upper  semi-continuous  [  ]. 
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and 


®g(P  ,£(t -)) 

25.  r*(z)(5z)  =  - ^ -  p2«*(t2). 


Therefore,  from  Theorem  (II. ©there .exist  a  vector  p  in 
and  a  vector  in  E*  such  that 

p*  <  0  for  i  =  l,  2,  •  •  • ,  p; 

27‘  (p.tj)  i  0; 

8g(P J(t  ))  - - 

28.  (H.Pjfizfy)  +  <ri.  - P26z^2»  <  0  for  all  62  S  C(z,0), 

»— W  '  *  — 

Since  every  6z  (t)K  defined  in  (22),  is  in  C(z ,0),  (28)  implies 
*,v 

that 


<H.P1*(t2.«)Ik(£(s),v)  -  J»(J(s),G(.))]>  + 


»g(P2^(t2))  .  . 

♦  (n.  - -  P2*(t2,*)[lt(z(«),v)  -  Mz(s),u(s))p  <  0 


for  every  1(1,  and  vtU. 
Hence, 


;  T  f  T  T  «gT(P,MtJ) 

(*  (t2,t)^p|p  +  p*  - - — --  nj  *  k*(*(t).v)-b(S(t).  u(t))>  <  o 

for  every  t  c  Ir  and  v  c  U.  Let  4(.)  .  (4°(.),  4'(.) . 4n(.)) 

lnto-E0**  defined  by 
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T  T  TP®  lPZ*W\ 

+  W  «.•  (**.»)(?!  H  +  P2|— Tx - -  rx). 


t«a. lor  almost  all  t  in  I,  «Ji(*)  satisfies  the  differential  equation: 


Combining  (30)  and  (31  )<  we  obtain 

33.  (»|/(t),  li(z(t),u(t)))  s  Maximum  {(4*(t).  M*(0»  v))  |  vtU}  for  t£l^. 

34.  Since  meas  (1^)  =  meas  (I),  (33)  holds  for  almost  all  t  in  I. 

34.  7  «g(p2*(t2)) 

Remark:  By  assumption  (see(J^)),  ■  ■  R  -  is  of  maximum  rank, 

— ■  —  ■ o  x 

and  since  Ch»  “H)  4  0,  4K*)  *  solution  to  {fl  )  is  not  identically  zero. 

•  Thus,  we  have  proved  the  following  theorem,  which  we  state  in 

terns  of  the  original  quantities  defining  the  optimal  control  problem, 
and  in  which  we  shall  substitute  (p°,p),  with  p  e  En  #  above. 
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II.  INFINITE  DIMENSIONAL  PROBLEMS 


We  shall  now  show  how  the  Fundamental  Theorem  presenter  .‘art  I 
can  be  extended  to  problems  In  Infinite  dimensional  spaces.  As  an 
application,  we  shall  use  our  extension  of  the  Fundamental  Theorem  to 
derive  the  Pentrygin  Maximum  Principle  [2] ,  for  fixed  time  optimal  control 
problems. 

First  let  us  formulate  the  equivalent  of  the  Basic  Problem  in  an 
infinite  dimensional  space. 

1.  BASIC  PROBLEM;  Let  L  be  a  linear  topological  space. 

Given  a  function  f(.)  mapping  L  into  the  reals,  a  function  (.) 
mapping  L  into  E®  and  a  subset  QcL,  find  a  vector  X  e  Q  such  that  e(X)“  0 
and  such  that  for  all  X  c  Q  satisfying  r(X)  -  0, 

2,  f(X)  <  f (X) 

* 

We  shall  call  any  X  with  the  above  properties  an  optimal  solution. 

The  only  difference  between  the  formulations  in  the  proceeding 
section  and  (1)  above  is  that  before  we  specified  that  the  functions  f 
and  r  are  differentiable,  which  we  do  not  do  in  (1),  since  differentiability 
is  not  a  well  defined  concept  in  a  general  linear  topological  space. 

However,  to  obtain  an  extension  of  the  Fundamentsl  Theorem  we  need 
a  linear  function  from  L  into  E**1  to  take  the  place  of  the  Jacobian  matrix 
in  (1 . 1)  and  a  suitable  continuous  function  from  L  into  to  take 
the  place  of  the  function  0(.)  in  (1.9).  Since  we  can  no  longer  ensure 
the  existence  of  such  functions  simply  by  requiring  that  f(.)  and  r(.)  be 
differentiable,  we  take  care  of  this  situation  by  incorporating  the  require ! 
functions  into  the  definition  of  a  conical  approximation.  As  we  shall  see 
later,  this  is  not  a  restrictive  practice. 


3.  DEPICTION :  A  convex  cone  C(X,Q)  C  L  will  be  called  a  conical 
•Bgroxlation-to  the  set  fl  atj_e  0.  with  reject  to  the  man  F  A  fV)  lf 
there  exiete  e  linear  functior.  t  ( i)(.)  fro.  L  into  E**1  .uch  that  for 
ny  finite  collection  {5  x^,  d  Xj,...,^  x^}  of  linearly  independent  vectors 
in  C(*,n)  there  exists  an  E  > 0  a  continuous  nap  6(.)  fro.  co{*,  m&Ed*, 

^E4*2 . **E6  V  int0  n*  •**  *  continuous  map  o(.)  from  L  into  E0*1, 

with  E,  6,  and  o  possibly  depending  on  *,6x,  dx2...,  6x(k)  which  satisfy 


4. 


llB  1 1  1 1 

E-*o  E 


o 


uniformly,  for  all  yeco  {i,  t+dXj,...,  *+d  xk},  and, 

5.  F(6(x))  -  K«+P'(1)M)h(x-*) 

for  all 

*  c  co  {*,  a+dxj,...,*  +d 

Me  are  now  ready  to  extend  theorem  (2.3.9). 


— ^>RPj:  If  *  U  •“  ®pt lmal  solution  to  the  Barre  problem  (.)  and 

c(t.O)  i.  «  conical  approximation  to  0  at  *  c  0  with  respect  to  the  map  F-(f.r). 
then  there  exists  a  nonr.ro  vector  *  -  (*°,  f1 . *■)  in  -lth  #o<o> 

each  that  <  4.  F*  <»(««)>  <  0  for  all  6x£c(tTQ7).  where  cTTo)  u  the 

closure  of  c<*,fl)  in  L. 
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large-scale  System  Optimisation:  A  Review 
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George  B.  Dantslg 


mtheaatlcal  programing  is  a  generic  term  for  the  related  fields  of  Linear 
Programing,  network  Flow  Theory,  Integer  Programing,  Convex  end  Son-Linear  Pro¬ 
graming,  and  Programing  under  Uto certainty.  Its  research  has  problems,  parti¬ 
cularly  those  problems  where  random  events  and  decision  events  occur  alternately 
In  successive  stages.  In  problems  where  such  v  icertainty  occurs,  what  is  usually 
dene  in  formulating  is  to  replace  the  uncertain  elements  with  their  expected  va¬ 
lues  (with  possible  an  added  safety  factor).  It  is  well  known  that  a  plan  based 
on  expected  values  of  its  coefficients  sad  constraints  can  lead  to  answers  that 
are  not  correct.  Although  the  use  of  expected  value  does  not  lead  to  the  best  an¬ 
swer,  it  Is  entirely  possible  that  it  could  lead  to  excellent  plans  indistinguish¬ 
able  from  the  optimum  la  the  run-of-the-mill  application.  When  one  considers  in¬ 
stead,  a  direct  attack  on  uncertainty  via  mathematical  programing,  It  Inevitably 
leads  to  the  consideration  of  large-scale  systems.  These,  because  of  their  struc- 
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tore,  have  proven  difficult  of  solution  so  far,  but,  I  believe,  of  Intensive  In¬ 
vestigation  In  the  future. 

Mathematical  programming  is  a  ten  invented  by  Robert  Dorfman  of  Harvard  a- 
rouad  1950.  He  felt  that  at  that,  time,  the  fundamentals  of  linear  programming 
were  well  enough  known  that  the  wavc-of-the-future  lay  in  the  extension  of  the 
methods  of  linear  programing  Into  the  non-linear  programing  field.  Certainly 
we  today,  15  years  later,  feel  this  is  true,  m  the  Calculus,  the  derivative 
(or  first  order  approximation)  plays  a  key  rols.  Applied  to  non-linear  Inequal¬ 
ity  system,  it  leads  to  approximation  by  linear  inequality  system.  This  is  one 
way  which  these  extensions  have  taken  place,  and  illustrates  why  the  various  fields 
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comprised  under  mathematical  programing  are  related.  Sere  are  bom  other  ways: 

One  attempts  to  extend  the  concept  of  duality  to  nonlinear  system.  Baring 
done  me,  one  trie*  to  combine  the  ootiblnatosrlal  power  of  linear  program  with  the 
elassloal  steepest  descent  processes  to  solve  non-linear  program. 

One  attempts,  as  we  bare  Just  noted,  to  reduce  problem  uncertainty 

to  equivalent  deterministic  system  and  to  large-scale  system  with  special  struc¬ 
ture. 

One  tries  to  solve  an  Integer  progran  by  replacing  It  with  an  equivalent  lin¬ 
ear  progran;  that  is  to  say,  by  cleverly  building  up  a  set  of  linear  inequalities 
that  are  both  necessary  and  sufficient. 

In  all  of  these  developments,  one  characteristic  stands  out;  nsmly  in  one 
way  or  another,  techniques  for  solving  large-scale  system  play  a  dominant  role. 

Accordingly,  let  us  look  first  at  direct  methods  for  handling  large  problem. 
Around  195^  or  so,  under  the  auspices  of  The  RAND  Corporation,  William  Orchard- 
Bays  produced  the  first  truly  commercial  linear  programming  code.  It  had  many  fea¬ 
tures  that  helped  amateurs  to  get  their  problem  on  the  machine  with  a  reasonable 
chance  of  getting  an  answer.  Today,  the  building  of  a  linear  programming  code 
(complete  with  all  the  special  features)  is  a  major  undertaking  which  Is  expen¬ 
sive  to  produce  and  to  maintain. 

As  applications  grow,  there  has  been  an  Increasing  rtfimnl  to  truly  e- 

oormous  system.  The  Russian,  Kantorovich,  in  his  1939  pamphlet,  envisioned  such 
a  possibility.  Already,  linear  programing  models  of  industrial  system  have  been 
solved  with  more  than  10^  variables  and  10*'  equations.  These  models,  of  course, 
do  not  have  general  matrix  structure  and  It  is  not  likely  that  may  instance  of  a 
large  practical  problem  will  ever  have  general  structure.  The  reason  is  obvious. 
JUst  Imagine  the  physical  task  aloes  of  finding  all  the  coefficients  for  a  thou¬ 
sand  by  ten  thousand  general  linear  program  (there  could  be  a  high  as  10^  non- 
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zero  coefficients). 

Fortunately,  large-scale  practical  node  Is  tend  to  have  a  low  percentage  of 
non-zero  coefficients;  in  fact,  under  ft,  sene tines  under  l£.  Orchard-Bays '  first 
code  exploited  this  characteristic  by  asking  use  of  a  "pricing  rector".  This  aade 
Inexpensive  the  selection  of  the  pivot  coltmm  directly  from  the  original 'ndn-dense 
data.  The  pivot  coluan  here  refers  to  one  of  the  steps  in  the  eiaplex  method  for 
solving  linear  programs. 

As  systems  have  grown  in  size,  every  advantage  has  also  been  taken  of  the 
characteristics  of  the  improved  computers.  It  has  been  discovered  recently  that 
the  size  of  the  inverse  representation  of  the  basis  in  the  simplex  method  could 
have  an  important  effect  on  running  tine.  Therefore,  compact-inverse  schemes  a- 
long  the  lines  first  proposed  by  Barry  Markowitz  of  RAHD  have  become  increasingly 
Important.  Recently,  two  groups  working  Independently,  developed  this  approach 
with  astounding  results.  For  example,  the  Standard  Oil  Company  of  California 
group  reports  running-time  on  some  of  their  typical  large  problems  cut  to  l/4. 

How  to  find  the  most  compact  Inverse  representation  of  a  sparse  matrix  is 
still  an  unsolved  problem; 

Conjecture :  If  a  non-singular  matrix  hae  K  non-zero  elements,  it  is 
always  possible  to  represent  them  as  a  product  of  elemsn- 
tary  matrices  such  that  the  total  number  of  non-«sro  en¬ 
tries  (excluding  their  diagonal  unit  elements)  is  at 
most  K  .  [incidentally,  the  inplrloal  schemas  Just 
mentioned  often  have  no  more  than  K  ♦  1O0IC  non-zeros 
in  the  Inverse  representation.  I 

Dynamic  structures  are  interesting  in  themselves,  and  could  have  lxportant 
applications.  One  such  is  the  linear  control  processes  proposed  by  ftatryngln. 
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(I  will  speak  of  his  problem  later  In  connection  with  the  decomposition  prin¬ 
ciple.)  As  early  as  1954,  I  published  a  paper  on  how  to  compact  the  Inverse 
representation  of  "he  basis  with  a  staircase  structure  (see  figure).  Again  in 
1962,  I  discussed  mother  method  which  permitted  one  to  find  a  compact  Inverse 
and  then  efficiently  Maintain  till 3  compactness  in  moving  from  one  iteration  to 
the  next.  There  have  been  several  other  proposals,  all  excellent,  that  seek 
to  apply  the  sin  nlex  method  to  the  full  system  by  compacting  the  Inverse.  As 
far  as  I  know,  aone  of  these  direct  proposals  have  been  realised  in  computer 
codes. 
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large-scale  systems  have  been  attacked  indirectly  by  means  of  the  decompo¬ 
sition  principle.  Several  codes  have  been  written,  and  sooe  of  the  recent  ex¬ 
periences  have  been  encouraging.  J.  7.  Benders  in  his  tbeois  "Partitioning  in 
Mathematical  Programming  I960",  developed  the  dual  of  the  decomposition  prin¬ 
ciple,  and  shows  how  this  approach  can  be  u**d  to  deal  with  the  mixed-integer 

programming  problem.  Rosen  and  Beale  have  each  proposed  parti  tiooing  methods 
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for  solving  systems  whose  atructurea  fit  into  the  from work  of  a  econom  horl* 
sontal  and/or  vertical  border  vhlle  the  reminder  la  a  diagonal  net  of  Indepen¬ 
dent  blocks. 


Purely  combinatorial  problem  fora  an  important  division  of  aathaaatloal 
programing.  They  fall  briefly  Into  two  categoric  a.  The  flret  ore  thoee  prob- 
lean  whoee  atructurea  are  apeelal  —  like  the  transportation  problem  —  or  the 
aial—  amber  of  ares  which  "cover"  nodes  in  a  network  (graph).  Far  them, 
■pedal  mthods  hare  been  sought. 

One  of  the  aoct  tantalising  problems  of  this  type  baa  been  the  travelling 

ea  lemon  problem .  it  la  so  eloet  to  a  network-flow  typo  problem  that  one  mold 

hope  to  find  aaae  easy  representation  of  the  faces  of  its  polyhedral  eolation 

aat.  So  far,  now  baa  been  discovered.  There  la  also  a  close  relation  between 
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covering  problems  and  the  famous  four-color  problem.  The  other  approach  to  com¬ 
binatorial  problems  is  through  integer  programming,  mis  was  first  used  in  195^ 
to  solve  a  particular  large-scale  travelling  salesman  problem  by  Fulkerson, 
Johnson,  and  myself.  In  195®,  Gomory  laid  the  foundations  of  this  field  by 
showing  how  to  systematically  determine  a  necessary  and  sufficient  system  of 
linear  inequalities. 

The  inter-relation  between  large-scale  system  methods  and  integer  pro¬ 
gramming  was  brought  out  in  a  recent  paper  of  Goswry  entitled  "large  and  Ron- 
convex  Problems  in  Linear  Programming".  Here,  Gomory  reviews  in  a  unified  man¬ 
ner,  how  the  ideas  of  Integer  programming  and  those  of  the  decomposition  prin¬ 
ciple  can  be  combined  to  solve  many  important  applications  such  as  the  paper- 
trim  problem,  multi-commodity  flows  in  networks,  programming  of  e concede  lot 
sizes,  etc. 

Integer  programaing  methods  are  being  experimented  with  in  a  number  of 
places.  It  seems  likely  that  we  are  nearing  a  threshold,  and  that  we  will  soon 
see  some  excellent  coHwrelal  codes  produced  and  used  successfully  for  certain 
problems. 

I  will  no,%,  in  this  presentation,  describe  the  developments  in  non-linear 
programing.  Rather,  I  have  chosen  to  illustrate  the  power  of  certain  non¬ 
linear  programming  ideas,  such  as  the  generalized  linear  program  of  Wolfe  to 
an  interesting  problem  in  linear  control  theory. 

But  first,  I  would  like  to  review  the  concept  of  a  Generalized  Program. 

This  differs  from  an  ordinary  linear  program.  Instead  of  coefficients  in  each 
column  being  known,  the  column  P^  may  be  freely  drawn  from  a  convex  set,  Cj  . 
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♦0(x)  •  Z(Wn) 

where  ^(x)  are  continuoua  oonrex  function*  of  XcC  .  Let  u*  tiran  an 

x-x°  ia  known  each  that  ^(x0)  <  0  for  i  +  0  .  It  can  be  ahown  that  the 
generalized  program  on  the  following  page  la  equivalent. 
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FROBIAt:  Find  Mis  Z  aad  H,  Xt  ^  0  ,  ^  J  0  such  that 
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The  position  of  sn  Initial  basic-set  of  oolunna  la  indicated  by  heavy  dots. 

Tbs  associated  set  bf  slnplex  noltl pliers  are  denoted  by  ;  Initially, 

x  -  *°  where  *  0  for  1  i  0  and  *°  •  -e0(x°)  .  The  next  step  Is  to  font 

the  Iagranglaa 

e(x)-*0(x)a  Y  *iVx>  ♦  *o 

l-i 

f«r  «  •  «°  and  to  nlaialse  ♦(*)  for  x«C  .  Motion  that  the  lagrenglan  Is 
prsolsely  what  we  pet  If  we  were  to  "priot  ont"  tbs  general  oolun*  of  ooeffl- 
elents  of  tbs  warlabls  X  using  the  price  vector  to  for*  tbs  Inner  product. 
Thus,  we  wish  to  oboose  x  such  that  this  scalar  product  is  alalsna. 

lot  x  •  x°  be  tbs  wains  of  tbs  nLelaislng  x  .  If  d(x°)  •  0  ,  then 

x°  Is  an  optlnun  eolation.  If  #(x°)  <  0  ,  an  extra  eolana  Is  inserted  in  tbs 
generalised  prograa  with  coefficients  and  varl- 


able  .  Ae  problea,  restricted  to  those  variables  whose  coltsmu  bava 

known  coefficients,  is  then  optimized  using  the  simplex  method.  This  gives 

rise  to  a  new  set  of  simplex  multipliers  *  ■  *'  .  This  gives  rise  to  a  arv 

2  1 
solution  x  -  x  ,  etc.  At  any  stage,  the  approximate  solution  is  x  ■  fc^x 

using  for  those  which  solve  the  problem  restricted  to  those  varl- 

bles  with  known  coefficients. 

Let  us  turn  to  a  problem  in  control  theory.  The  application  of  mathe¬ 
matical  programming  methods  to  solve  control  problems  has  been  studied  by 
Zadeh  and  Whalen,  by  Ben  rosen,  and  others.  I  would  like  to  oonflnm  myself, 
however,  to  Linear  Control  Theory  as  described  by  Vontryagla,  Botlyeaakl,  Oam- 
krelldge,  and  Mischenko  in  Chapter  III  of  their  book  on  this  subject. 

WS  consider  an  "object "  defined  by  its  n  ♦  1  coordinates  x  * 

(^0  *  , ...jtg)  whose  "motion",  described  as  a  function  of  a  "time"  para¬ 

meter  t  ,  can  be  written  as  a  linear  system  of  differential  equations 

(1)  K  *  **  ♦  ** 

where  u  -  (u^  ,  u^  ,...,u^)  is  *  control  vector  that  must  be  chosen  tor  each 
t  from  a  convex  compact  set  U(t)  .  The  Initial  conditions  at  t»0  are 

*°  •  (o  .  .  (M»4>  • 

The  terminal  conditions  at  t  •  T  is  obtained  by  setting 

(2)  xT  -  X  T  ♦  »0 
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where 

ST-  (0,  il  ,  ,  («*•*) 

Eq  •  (i,o,o, . . . ,o) 

and  by  requiring  that  u  ■  u(t)  to  be  chosen  such  that 

(3)  Z  it  alnlaua 

As  given  In  Chapter  III  of  the  book  "Matheaatlcal  Theory  of  Opt  Inal  Con¬ 
trol  Processes"  by  Pontryagin,  Boltyashii,  Qeakrelidge,  Mischenko,  the  final 
state  my  be  written  in  the  fora 

00  -ZEq  ♦  JT  P^tB  u(t)dt  -  b 

0 

TO  tA 

where  b  •  X  -  P^f  is  a  known  vector,  and  Pt  •  e  Matrix  that  say  be  coo- 

leniently  eoaputed  as  a  function  of  t  .  Por  exaaple,  for  the  case  of  real  dis¬ 
tinct  characteristic  roots  ^  of  1  : 

a 

(5)  Pt  -  eU 

0 

where  are  square  an  trices  independent  of  t  .  The  latter  fonaaia  for  the 

la  developed  in  "An  introduction  to  the  Application  of  Dynaalc  Programing 

to  linear  Control  Systens"  by  P.  T.  aalth  in  BAUD  Report  RN-3526-FR,  February 

1963. 
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We  my  fornally  write  (4)  as  a  gene  rail  ted  linear  program. 

PROBLEM:  Find  Mia  Z,  u  £  0  such  that 

(6)  -ZE0  +  Tu-h 

n  ■  1 

where  I  my  be  freely  chosen  fro*  the  convex  set  defined  by 

rT 

(7)  Y*  /  P^BuCtJdt 

0 

for  all  possible  choices  of  u(t)  c  V(t)  . 

The  nethod  for  solving  the  generalised  linear  progrea  described  earlier 
for  convex  programing  can  be  applied.  Far  brevity,  we  onlt  the  question  cf 
how  to  obtain  the  initialising  basic  set,  except  to  say  it  Is  the  analog  of  the 
phase  I  procedure  of  the  ordinary  slnplex  nethod. 

I r 

As  soon  as  *  «  *  Is  determined  for  Iteration  k  we  seek  a  solution  of 
the  sub-problen  such  that  the  inner  product 

/T 

Py_tBu(t)dt 

0 

•  JT  [mnO^^iuftJJdt  for»(t)  t'U(t)  . 

o 

It  is  important  to  note  that  for  each  t  ,  **1^8  is  sane  known  vector  ct  . 

Thus,  for  each  t  we  oust  solve: 

3UB-PR0BXJM:  Find  Min  oSi  f or  u  «  U(t)  . 

If  U(t)  Is  a  polyhedral  set,  the  suh-problsn  Is  singly  a  linear  prograa.  If 
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for  all  t 


U(t)  is  the  ease  for  all  t  ,  then  the  linear  program  are  the  Mas 
except  for  the  varying  objective  forma  e*u  .  Interesting  enough,  the  sub -pro¬ 
grams  turn  out  to  be  the  same  as  vhat  Pontryagin  obtains  using  his  prin¬ 

ciple. 

A  control  problem  is  an  example  of  a  dynamic  system  which,  is  t  is  treated 
by  the  straightforward  procedure  of  discretizing  the  time,  would  lead  to  a  large  - 
scale  computational  problem.  The  generalized  linear  program  (or  decomposition 
principle  approach),  however,  provides  us  with  a  procedure  which  does  not  re¬ 
quire  the  discretizing  of  the  time  interval. 

The  last  twenty  years  have  been  marked  by  the  accelerated  trend  toward  auto¬ 
mation.  Many  believe  that  not  only  simple  control  processes,  but  soon  the  more 
complex  control  processess  will  be  mechanized.  If  so,  whether  we  like  it  or  not, 

decisions  will  be  made  for  us  by  machines.  Whether  or  not  they  will  be  good  de- 

/ 

cislons  will  depend  on  how  cleverly  we  have  instructed  the  machines.  This  ia 
turn  will  depend  heavily  on  how  clever  we  heve  been  in  developing  solution  tech¬ 
niques  for  solving  large-scale  systems. 

To  this  end,  we  have  sketched  several  ideas:  (l)  taking  advantage  of  the 
low  density  of  the  non-zero  coefficients  in  the  original  matrix,  (2)  finding  a 
compact  inverse  representation  of  the  basis  using  the  simplex  method,  And  (3) 
making  use  of  the  generalized  linear  program  or  decomposition  principle  approach. 
We  illustrated  the  latter  on  a  linear- control  problem  and  found  that  it  led  to 
the  maximal  principle  with  the  added  bonus,  however,  that  it  can  be  used  to  con¬ 
structively  converge  to  an  optimal  solution. 
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LINEAR  CONTROL  PROCESSES  AND  MATHEMATICAL 
PROGRAMMING* 

GEORGE  B.  DANTZIGf 

Linear  control  process  defined  [8],  [14].  We  shell  consider  an  “object” 
defined  by  its  n  +  1  coordinates  X  *»  (ar# ,  Xi ,  •  •  •  ,  *»),  whose  “motion” 
described  as  a  function  of  a  parameter,  “time”  (t),  can  be  written  as  a 
linear  system  of  differential  equations 

(1)  ~  =  A'X  +  flV 

I  t  9 

where  A  ,  B  are  known  matrices  that  may  depend  on  i  and 

tt  -  (til  ,  is* ,  •  •  • ,  ti.) 

is  a  control  vector  that  must  be  chosen  from  a  convex  set,  u  6  U(t)  for 
every  0  £  t  2£  T.  The  time  period  0  |  1  S  T  is  fixed  and  known  b  ad¬ 
vance.  The  coordinate  Xo  -  xo(t)  represents  the  “cost”  of  moving  the 
object  from  its  initial  position  to  zo (t).  For  this  purpose  it  may  be  assumed 
that  io(0)  *»  0.  Defining 

(2)  X  m  (0,  xx  ,*»,•••  ,  z»), 

the  object  is  required  to  start  somewhere  in  a  convex  domain  X(0)  c.  <S#  and 
to  terminate  at  t  =  T  somewhere  on  another  convex  domain  £(  T)  6  S?  , 
Problem.  Find  u  €  U(t)  and  boundary  values  Jf'(O)  €  So,  £(T)  €  Sf , 
such  that  Zo(  T)  is  minimized. 

Assuming  u  €  U(t)  is  known,  the  system  of  differential  equations  can  be 
integrated  to  yield  an  expression  for  X(T)  in  terms  of  -JT(O)  and  u  fc  U(t). 
This  is  true  in  general  but  will  be  illustrated  for  the  case  when  A *  and  B‘  do 
not  depend  on  t;  in  this  case 

(3)  X(T)  =  eTAX(0)  +  [*  e(T',)4Bu(t)  dt, 

Jo 

where  u(t)  6  U(t)  is  a  convex  set  and  where  we  assume  the  integral  exists 
whatever  be  the  choice  of  the  u(f)  €  U(t)  for  0  2s  t  £  T. 

Generalized  linear  program  [2],  Our  general  objective  is  to  illustrate 

*  Received  by  the  editors  January  12,  1966,  and  in  revised  form  Marob  25, 1965. 
Presented  at  the  First  International  Conference  on  Programming  and  Control,  held 
at  the  United  States  Air  Force  Academy,  Colorado,  April  15,  1965. 

t  Operations  Research  Center,  University  of  California,  Berkeley,  California. 
This  research  has  been  partially  supported  by  the  National  Science  Foundation  under 
Grant  GP-2633  with  the  University  of  California. 
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how  mathematical  programming  and,  in  particular,  how  the  decomposition 
principle  in  the  form  of  the  generalized  linear  program  can  be  applied  to 
this  class  of  problems.  An  elegant  constructive  theory  emerges,  [10],  [11], 
[12],  [13]. 

A  generalized  linear  program  differs  from  a  standard  linear  program  in 
that  the  vector  of  coefficients,  say  P,  associated  with  any  variable  m  need 
not  be  constant  but  can  be  selected  from  a  convex  set  C.  For  example: 
Problem.  Find  max  X,  m  £  0  such  that 

(4)  C/oX  +  Pyt  -  Qt ,  m  -  1, 

where  t/0 ,  Qq  are  specified  vectors  and  P  €  C  convex. 

It  is  assumed  that  the  elements  of  C  are  only  known  implicitly  (for 
example,  as  some  solution  to  a  linear  program)  but  that  particular  choices 
of  P  can  be  easily  obtained  which  minimize  any  given  linear  form  in  the 
components  of  P. 

The  method  of  solution  assumes  we  have  initially  on  hand  m  particular 
choices  Pi  €  C  with  the  property  that 

(7oX  +  Pin i  +  PtM*  +  •  •  •  +  P*m»  m  Qo , 

(3) 

Ml  +  «+•’•+  Mm  “  1, 

has  a  uniqu"  “feasible”  solution;  that  is  to  say,  X  -  X8,  n<  -  m<°  £  0  and 
the  matrix 


(6) 


B° 


Uo  Pi 
0  1 


is  nonsingular  (i.e.,  the  columns  of  P°  form  a  basis).  Because  P<  €  C,  the 
vector  P°  «  Pm  constitutes  a  solution  P  -  P°  for  (4)  except  that 
X  -  X°  may  not  yield  the  maximal  X. 

To  test  whether  or  not  P°  is  an  optimal  solution  one  determines  a  row 
vector  f  =  r  such  that 


(7) 


(1,0,  --,0), 


and  then  a  value  a  and  a  vector  Pm+i  €  C  such  that 

(8)  a  -  rPm+i  -  min  wP, 

nc 

where  we  denote 

(9)  P-[f]. 

If  it  turns  out  that  5  =  0,  then  P  —  P°  is  an  optimal  solution. 

1  This  is  not  a  restrictive  assumption  since  there  is  an  analogous  method  for  ob¬ 
taining  such  n  starling  solution,  see  (2). 
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If  P°  is  not  optimal,  system  (5)  is  augmented  by  Pm+i .  After  one  or 
several  iterations  k  the  augmented  system  takes  the  form  of  a  linear  pro¬ 
gram  : 

Problem.  Find  max  X,  n <  £  0, 

m+k  m+k 

(10)  Udk  +  J^PiUi  «  Qo,  =  i. 

1  1 

Letting  F*  denote  the  basis  associated  with  an  optimal  basic  feasible  solu¬ 
tion  ^  =  m.*  to  (10),  x*  is  defined  analogous  to  (7)  and  $*+1  and  Pm+*+ 1 
analogous  to  (8).  If  it  turns  out  that  S  =  0,  the  solution 

m+k 

(in  .  /**  -  Z  f .».* 

1 

is  optimal.  If  not  the  system  is  augmented  by  Pm+k+i  and  the  iterative 
process  is  repeated. 

It  is  known  under  certain  general  conditions,  such  as  C  bounded  and  the 
initial  solution  nondegenerate  (i.e.,  m.°  >  0),  that  r  -♦  f*  and  P*  — ► F*  on 
some  subsequence  k  and  that  P  =  P*  is  optimal.  The  two  fundamental 
properties  of  f*  are 

(12)  7r*  0  and  f*P  r*P*  *  0  for  all  P  6  C. 

The  entire  process  can  be  considered  as  constructive  providing  it  is  not 
difficult  to  compute  the  various  P„+*+ t  from  (8)  with  i  -  rm+i.  For  ex¬ 
ample,  if  C  is  a  parallelepiped  or  more  generally  a  convex  polyhedral  set, 
then  min  fP  constitutes  the  minimization  of  a  linear  form  with  known ' 
coefficients  t  -  fm+k  subject  to  linear  inequality  constraints  in  the  un¬ 
known  components  of  P,  i.e.,  a  linear  program.  In  this  case  the  iterative 
process  terminates  in  a  finite  number  of  steps  and  Pm+k  constitute  ex¬ 
treme  solutions  from  it.  In  all  cases  an  estimate  is  available  on  how  close 
the  Pth  solution  is  to  au  optimal  value  of  X. 

Application  of  the  generalized  program  to  die  linear  control  process.  Let 

us  denote 

(13)  P  -  f  r lv' ABu(t)  dt, 

and  note  that  P  is  an  element  of  a  convex  set  C,  generated  by  choosing 
all  possible  u(<)  €  (/(<)•  We  specify  that  (/*  -  (1,  0,  •  •  ,  0),  and  denote 
by  X  «  -Xt (T),  where  X$(T)  is  the  coordinate  of  X(T)  to  be  minimized. 
Then 


(14) 


X(T)  -  -U*  +  X(T). 
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We  further  define  Qo  by 

(15)  X(T)  -eTAX(0)+Qo. 

Substitution  of  these  into  (3)  formally  converts*  the  integrated  form  of 
the  control  problem  into  a  generalized  linear  program  (4). 

Each  cycle  of  the  iterative  process  yields  a  known  row  vector,  which  we 
partition 

(16)  *‘+I  «  [x,  9], 

where  r  represents  its  first  n  +  1  components  corresponding  to  P  and  9  its 
last  component  S'ix  c  t  is  known,  our  choice  for  Pm+*+)  is 

(17)  *Pm+k+i  =  mm  (  f \e(r-,)ABu(t »  it)  -  f  {  min  e(T~,)ABu(t)  j  dt, 

(Jo  ;  Jo  u(urt) 

where  clearly  the  minimum  is  obtained  when,  in  (17),  the  integrand  for 
each  t  is  selected  to  be  minimum. 

Note  that 

(18)  -  welT~t)AB 

is  a  row  vector  that  can  be  computed  for  each  t.  For  example,  $<,r  can  be 
represented  by  a  finite  sum  of  vectors  whose  weights  depend  on  t  and  the 
eigenvalues  of  A.  The  new  extremal  solution  P«+*+i  is  obtained  by  choosing 
the  control  which  minimizes  the  linear  form  in  u  for  each  t;  i.e.,  find 

(19)  min  u  6  U(t). 

For  example,  if  U(t)  is  a  polyhedral  set  then  (19)  is  a  linear  program.  If 
U(t)  is  the  same  for  all  t,  then  only  the  objective  form,  $,.ru,  varies  for 
different  f;  except  for  the  objective  form  the  linear  programs  are  the  same 
for  all  t. 

If  optimal  *-*  is  used,  then  the  optimal  control  «  (except  for  a  set  of 
measure  aero)  satisfies 

(20)  min  (**(<)«]'  «  6  U(t), 

where  ♦*(<)  -  r*e(r~,,4B.  Pon  try  agin  refers  to  this  as  the  maximal  prin¬ 
ciple.  It  is,  as  we  have  just  shown,  also  a  oonsequenoe  of  the  decomposition 
principle  of  linear  programming. 

Cooduston.  In  our  approach  the  general  control  obtained  for  each  cycle 
is  a  linear  combination  of  exactly  n  +  1  special  controls  obtained  by  mini* 

*  Actually  Q»  ie  not  Riven  hut  it  an  element  of  a  convex  set.  To  simplify  the  dis* 
cuttiott  which  follows  we  assume  Q»  it  a  fixed  vector. 
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mixing  for  each  t,  the  linear  expression  (19)  in  u  for  »  +  1  cnoices  of  r. 
These  special  control-  may  he  referred  to  as  extreme  control  t.  The  latter 
each  in  themselves  do  not  maintain  feasibility,  that  is  to  say,  guarantee 
that  the  object  will  move  from  J?(0)  to  X(T).  Each  new  linear  combina¬ 
tion  of  these  special  controls  will,  however,  generate  a  new  feasible  control 
with  a  lower  value*  for  the  total  cost  Xo(T).  Undei  the  conditions  stated 
this  iterative  process  is  known  to  converge. 
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ALL  SHORTEST  ROUTES  IN  A  GRAPH 


by 

George  B.  Dantzig 

A  shortest  route  is  sought  between  every  pair  of  nodes  (i,  j) 

in  a  graph  when  directed  arc  distances  are  given,  where  the 

values  of  a^j  nay  be  positive,  negative,  or  zero  except  a^  *  0  . 

If  the  graph  is  incomplete  so  that  an  arc  (i,  J)  is  missing,  the 

value  of  a, .  *  •  This  problem  (as  is  well  known)  includes  the 

travelling  salesman  problem  since  the  route  for  (i>i)  is  a  cycle 

and  one  can  solve  a  travelling  salesman  problem  with  distances 

d^  >  0  by  finding  a  minimum  cycle  in  a  graph  [a^j  «=  d^j  -  K] 

where  K  >  £.  £.  d. .  .  Our  objective,  therefore  is  more  modest, 

1  J 

it  is  to  find  a  negative  cycle  in  a  graph  if  one  exists,  if  none 
then  to  find  all  the  shortest  routes. 

The  procedure  is  inductive  and  was  stimulated  by  a  remark  of 

# 

Ralph  Oooory's  that  an  inductive  approach  was  probably  as  efficient 
as  any  other.  It  is  not  certain,  however,  whether  this  procedure 
has  appeared  elsewhere  in  the  literature  and  so  is  presented  here. 

It  is  shown  that  n(n-l)  additions  and  an  equal  number  of  comparisons 
are  required  to  solve  an  n  node  problem.  This  number  can  be 
reduced  to  n(n-l)(n-2)  if  negative  cycles  are  known  not  to  exist. 

This  method  is  therefore  as  efficient  as  the  best  result  known, 
that  of  Murchland  (3]  . 


1 


It  is  similar  to  many  proposed  schemes  in  that  entries  in  the 

matrix  are  replaced  by  a^  +  a^  if  the  latter  sum  is  smaller 
for  some  choice  of  k  .  After  replacement  the  nev  matrix  is 
operated  upon  in  the  same  way  until  no  improvement  cam  be  found.  Ihe 
various  methods  differ  only  in  the  rules  for  scanning  the  various 
(i,  J)  and  k  .  In  order  to  keep  track  of  the  routes  as  well  as 
their  values,  it  is  also  necessary  to  record  for  each  (i.J)  either  the 
first  arc  of  the  minimum  route  from  i  to  j  or  the  last  arc.  With 
this  information  it  is  easy  to  generate  all  the  arcs  along  the 
route.  Aside  from  the  efficiency,  the  second  advantage  of  the 
method  is  the  simplicity  of  the  proof  of  it, a  validity. 

Assume  for  nodes  1,  2,  .  .  .  ,  k  -  1  that  optimal  distances 
a^  sure  given,  we  wish  to  determine  optimal  distances  a*^  for 
nodes  1,  2,  .  .  .  .  ,  k.  We  shall  show  that 

For  l  m  1,  .  .  .  ,  (k-i) 


(1) 

\'i  •  Mf  (\j  *  ;Ji! 

(2) 

"Ik  ■  Kf-  (\»  *  "i! 

(3) 

si  ■  Hf 10  "kj  ■  • 

(i  - 1,. 

. k-1)  and  (J  »  1, . . 

oo 

**j  * Kln  [  *'ij  ■  "Ik 

The  inductive  procedure  begins  with  a^  ■  0  and  stops  if  at  any 
tine  a  diagonal  value  a*^  <  0  appears  in  which  case  a  negative 


2 


cycle  has  been  obtained;  or  if  step  k.  ■  n  has  been  coo^leted. 

Proof:  (l)  states  that  a  minimus  route  from  k  to  i,  starts 
with  some  arc  followed  by  a  mininuo  route  from  j  to  X  that 

does  not  go  through  k  .  Hence  the  minimum  of  these  alternative 
routes  is  the  one  desired. 

Formula  (2)  is  the  same  idea  except  the  alternative  routes 


are  defined  by  the  last  arc 


route  from 


> 


cf  the  route  and  the  beet 


to  J  that  does  not  go  through  k 


Formula  (3)  states  that  either  =  0  is  the  best  route 
from  k  to  k  or  there  is  a  negative  cycle  consisting  of  going 
along  some  best  route  from  k  to  l  and  then  i  back  to  k  . 

Formula  (4)  states  that  either  the  best  route  from  (i,  j) 
does  not  go  through  k  (and  has  value  a  )  or  does  go  through  k 


i  y 


(and  has  value  a^  4-  a*(  ) 

The  count  on  additions  Is 


C  -  Z  f(k  - l)(k-2)  +  (k-l)(k-2)  4-  (k-1)  ♦(k-ir] 

k  4-  1 

where  the  four  terms  are  the  count  of  (1),  (2),  (3),  (4) 
respectively.  Rote  that  we  omitted  from  the  count  (for  example) 
the  addition  a^4-  j  because  is  known  to  be 

xero.  In  the  case  that  negative  cycles  are  known  not  to  exist,  the 
third  term  may  be  dropped  and  the  laat  term  reduced  by  (k-1)  since 
the  diagonal  a*^  ■  0  In  the  latter  case,  the  count  is 
C  «  n  (n-l)(n-2)  additions  and  an  equal  number  of 


comparison# . 
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ALL  SHORTEST  ROUTES  FROM  A  FIXED  ORIGIN  IN  A  GRAPH 

by 

G.  B.  Dantz-tg*,  W.  Blattner**  and  M.  R.  Rao** 

A  shortest  route  is  sought  between  a  fixed  origin  node  i  *=  0 

to  n  other  nodes  in  a  graph  when  directed  arc  distances  are 

given  and  the  values  of  may  be  positive,  negative,  or  zero 

i  /  j  .  No  values  c  are  specified  unless  there  is  an  arc  from 

i  to  J  •  This  problem  (as  is  well  known)  includes  the 

travelling  salesman  problem  with  distances  d^  >  0  because  one  can 

set  (c. .  =  d. ,  -  K]  where  Yi  >  Z,  Z.  d. .  and  look  for  a  minimum 
XJ  lj  i  j  ij 

route  from  0  back  to  itself.  Iherefore  our  objective  will  be  more 
modest :  To  find  a  negative  cycle  in  a  graph  if  one  exists  or  if  none 
exists  then  to  find  all  the  shortest  paths  from  the  origin. 

The  method  is  inductive.  On  step  k  ,  there  is  a  set  S^ 
consisting  of  the  origin  and  k  -  1  other  nodes.  Restricting  arcs 
to  those  that  belong  to  the  subgraph  of  S^  ,  the  minimum  distances 
from  the  origin  along  these  arcs  to  nodes  i  e  are  assumed 
known  and  have  value  .  It  is  also  assumed  that  no  negative 
cycles  exist  in  the  subgraph  of  S^  •  It  follows  that 

(l)  ni  +  c^j  >  IIj  for  all  i  «  Sjj.  ,  J  €  . 


*  Stanford  University 

**  U.  S.  Steel 
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Theorem  1:  Let  denote  the  length  of  the  shortest  route  from  i 

to  j  along  arcs  of  the  subgraph  of  containing  no  negative 
cycles  and  let  (l)  hold,  then 

(2)  Dy  >  Hj  - 

Proof:  Let  the  sequence  (i  ;  i^  ,  ;  j)  denote  the  nodes 

along  a  minimum  route  from  i  to  J  in  ,  then  by  (l)  , 


ni  +  —  ^i  *  +  ci  i  —  ^i  * 

1  1  1  1  12  2 


'  \  +  CixJ  -  "j  ' 


Adding  these  inequalities  together  yields  the  desired  relation. 

Assuming  now  that  we  know  the  minimal  distances  for  , 

we  wish  to  augment  by  Including  a  node  q  jt  .  We  denote 
Sk+1  *  an<*  wl8^  t0  determljle  minimal  distances  II*  from 

the  origin  along  arcs  of  the  subgraph  of  S^+1  to  nodes 
1  €  S^+1  .  The  theorem  below  permits  us  to  determine  II* 
immediately. 

Theorem  2:  Let  q  jt  ,  and  «  js^,  ,  qj  then  a  shortest 

route  from  0  to  q  in  3,  has  as  last  arc  of  the  route 
-  —  -i  —  k+1 - 

(p  >  <l)  where  p  e  satisfies 

(3)  np  ♦  =M  -  Min  g  (Ht  ♦  y 

k 

and  H*  =  H_  +  c  is  the  minimum  distance  from  the  origin  to  o 
-  q  p  pq  - “ - 
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Proof'  Suppose  false  and  a  shorter  route  is  via  p  €  ,  then 


H-  +  c-  <  n  +  c 
p  pq  p  pq 


contradicting  (3)  .  This  theorem  is  true  even  if  S^  has  negative 

cycles.  The  II*  and  31^  would  then  represent  the  shortest 
distance  without  cycles  from  the  origin. 

Knowing  H*  ,  Theorem  (k)  below  may  now  be  applied  to 
determine  for  another  node  X  e  ,  its  minimum  distance  JI£ 
from  the  origin  along  arcs  of  the  subgraph  of  .  Knowing  II* 

and  H*  we  reapply  Theorem  (4)  again  and  again,  each  time 
finding  a  least  distance  for  another  node  in  S^^  .  Ibis  is  done 
until  all  nodes  are  exhausted  in  or  the  optimality  condition 

5  >  0  of  Theorem  3  below  is  satisfied  in  which  case  the 

remaining  1^  values  are  also  optimal  for  ,  or  the  negative 

cycle  condition  of  Theorem  5  is  satisfied. 

Theorem  3:  Let  T  be  any  subset  of  nodes  1  whose  minimum  distance 


H*  from  the  origin  along  routes  in  the  subgraph  of  Sk+1  is  known, 
let  q  c  T;  let  S  and  T  contain  no  negative  cycles;  let 


6ij  3  ni  +  ciJ  “  nj  i  c  T  ,  J  /  T 


then,  if 


(5)  >0 


for  all  i  c  T  ,  J  /f  T 


the  minimum  distance  for  all  remaining  nodes  is 


3 


This  theorem  Is  true  even  if  T  contains  negative  cycles  but  requires  a 
different  proof. 

Proof;  The  conditions  for  optimality  in  analogous  to  (l)  are: 

(7)  -  nj  +  cy  -  Bj  >  0  1  «  T  ,  j  t  T 

"i  +  ciJ  ■  nj  -  0  1  i  7  .  J  t  * 

”i  +  clj  •  "*  i  0  i  €  T  ,  J  €  T 

"i +  CU  -nV-0  1  * T  -  •> €  T 


The  first  of  these  holds  by  hypothesis  (5),  the  second  by  (l),  the 
third  by  hypothesis  that  the  T  set  is  optimal  in  (and  there 

are  no  negative  cycles  in  T);  finally  the  fourth  because 
II*  <  llj  and  (l)  holds. 


On  the  other  hand  if  the  optimality  conditions  6  .  >  0  of 
Theorem  3  does  not  hold  for  all  i  €  T,  J  /  T  ,  then  6. .  ■  Min  5.  .<  0 

tjL  1J 

holds  for  some  t  €  T  and  i  jl  T.  It  will  be  shown  in  Theorem  4, 
that  the  minimum  distance  from  the  origin  along  arcs  of  th?  subgraph 
to  node  X  is  given  by  +  6^  Thur  Theorem  4 

may  be  reapplied  until  there  are  no  longer  any  nodes  In  not  in 

T  or  condition  (3)  holds,  or  a  negative  cycle  is  detected,  but  we 
will  speak  more  about  this  later  in  Theorem  3. 


Theorem  4;  Let  and  T  contain  no  negative  cycles  where  T  is 

any  subset  of  nodes  1  whose  minimum  distances  from  the  origin  in 
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Is  II*  .  If  for  some  t  €  T,  l/t  T 

(8)  8t  -  Min  6^  <  0  1  €  T,  J  /£  T 


(9)  nj.ni+  + 


Is  the  minimal  distance  from  the  origin  along  arcs  in  the  subgraph  of 


sk+1  is. node  l  *x^ 

Proof :  On  the  contrary,  if  there  is  a  shorter  route  to  t  ,  then  this 
route  must  include  the  node  q  and  perhaps  some  other  nodes  of  T 
(otherwise  would  be  minimum  but  we  know  II|  <  1^  by  (8)  and 
(9)<  Along  this  shorter  route  let  (t  ,  Z)  be  the  last  arc  such 
that  t  c  T  ,  Z  /t  T.  Then  the  distance  along  the  route  from  J 
to  X,  may  be  denoted  by  (see  Theorem  l)  because 

the  nodes  from  £  to  JL  are  all  elements  of  3^  .  By  Theorem  (1) 


(10)  nj 


On  the  other  hand  by  virtue  of  the  assumed  shorter  route  through 


t  >  l 


(n)  +  c£l  +  ♦  cu 


'This  theorem  also  holds  it  T  contains  negative  cycles  and  H* 
are  the  shortest  distances  from  the  origin  along  routes  without  1 
cycles . 
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Subtracting  (10)  from  (ll)  and  rearranging 

nt  +  ctl  "  nl  <nt  +  ct!  ’  ni 

or  <  by  (4)  which  contradicts  hypothesis  (8)  of 

Theorem  4. 

Theorem  5t  If  S^,  T  contain  no  negative  cycles  and  the  shortest 

<  nt  and  T  is 

augmented  to  T*  =  |  T,^j  where  J)  is  as  defined  in  Theorem  4,  then  a 
a  necessary  and  sufficient  condition  that  T*  contain  a  negative 
cycle  is 

(12)  mf  +  c„  -  n*  &„  <o 

l  JL*  *  M 

Proof:  Since  H*  <  II ^  holds  the  optimal  route  from  the  origin  to  JL 
in  passes  through  q  .  If  (12)  holds,  then  the  cycle 

consisting  of  the  optimal  route  from  q  to  Jl  and  then  arc  (X ,  q  ) 
has  negative  length.  This  may  be  seen  by  summing  the  relations 
nI +  cu  »  II*  along  the  route  from  q  toi  and  then  adding  it 
to  (12).  If,  on  the  other  hand,  (12)  does  not  hold,  then 
we  will  show  that  II*  +  c^  >11*  for  all  ic  T*,  J  c  T*  which 
implies  that  no  negative  cycle  in  T*  exists  (as  one  can  see  by 
summing  such  relations  over  the  arcs  of  a  cycle-) 

We  need  now  only  rule  out  for  some  i  and  J  £  q  that 

•  • 

II*  +  Cf  j  <  II*  .  This  would  mean  we  could  lower  the  value  oi  II* 

•  •  #  O  | 

by  making  i  the  node  that  precedes  J  along  the  optimal 

•  • 

route  instead  of  some  .  This  deletion  of  the  arc  (i1  J  )  from 


distance  from  the  origin  in  Sj^  for  i  e  T  is  II* 
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2) 

the  tree  '  of  optimal  routes  and  entering  the  arc  (i  J  )  Into  the 

•  • 

tree  either  would  provide  a  shorter  route  to  j  or  it  would  cause 

a  cycle  to  form  which  (by  an  earlier  argument)  is  negative. 

However  neither  is  possible  because  the  former  implies  a  shorter 

route  to  J  (because  H*  was  lowered)  while  the  latter  implies 
0  j 

O 

a  negative  cycle  not  involving  q  .  The  cycle  cannot  involve  q 
because  all  shortest  routes  i  c  T*  from  the  origin  pass  through  q 
and  there  are  no  directed  arcs  into  q  along  the  tree  of  optimal 
routes  in  T*  .  But  a  negative  cycle  in  is  contrary  to 

assumption. 

Thus  a  negative  cycle  will  always  be  found  if  there  is  one  by 
(12).  If  one  is  found  the  Inductive  process  terminates. 

The  following  theorem  due  to  M.  Sakarovitch  (verbal 
communication)  permits  one  to  find  the  minimal  distance  in  S^+1  to 
several  nodes  at  once. 

Theorem  6  (Sakarovitch):  Let  L  be  the  nodes  in  the  tree  of 
optimal  routes  in  which  are  successors^  of  j,  as  defined  in 
Theorem  4,  then 

(13)  H*  »  n4  +  btl  for  i  c  L  . 

2  V 

'Note:  If  there  are  no  negative  cycles  in  S.  and  T  in  Art 
there  is  a  tree  of  optimal  routes  to  i  e  T  branching  out 
from  the  origin;  also  the  added  arc  (t.,_L)  with  t  €  T,  Xfi  T  still 
yields  a  tree  of  shortest  routes  without  cycles  in  i  €  T*  . 

■^The  tree  of  optimal  routes  from  the  origin  forms  a  partially  ordered 
set.  The  "successors"  of  X  are  those  nodes  reached  through  X  • 
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Proof:  One  notes  first  that  the  distance  11^  +  6^  can  be  realised 
by  first  going  along  the  optimal  route  to  JL  and  then  along  the 
former  route  from  JL  to  i  £  L  .  Now  assume  on  the  contrary  that 
there  is  a  better  route  to  i  .  As  in  proof  of  Theorem  4,  let  tX 
be  the  last  arc  of  a  better  route  such  that  t  t  I  and  X  jt  T  , 
then  Jljr  +  c-^  +  .  Subtracting  D ^  >  It^-  11^  , 

yields  <  6^  contrary  to  (8)  . 

For  completeness  we  give  the  following  well  known  theorem,  [3]. 

Theorem  7:  If  c^  >  0  and  of  are  known  to  be  the 

minimal  distances  from  the  origin  for  the  k  nodes  of  using 

arcs  of  the  full  n-node  problem,  then  II  ■  H  +  c  is  the  minimal 

- e - ! -  q  p  pq  - 

distance  for  q  /  where 


<*>  "p  +  Cpq  =  Min  (lIi  +  CiJ}  ’  P  C  \ 

l€Sk 

Proof:  If  not,  then  q  is  reached  via  some  shorter  route  that 

has  nodes  in  common  with  (since  Includes  the  origin).  Let 

(t  ,  q)  be  the  last  arc  on  the  shorter  route  with  t  <  and 
q  k  3^  ,  then 

(15)  Hr  +  c +  (min  distance  q  to  q)  <  H  ♦  c _ 

t  tq  p  pq 

but  this  relation  contradicts  (14)  because  minimum  distance  from 
q  to  q  is  non-negative  when  c^j  >  0  . 

We  are  now  in  a  position  to  give  a  coint  on  the  number  of 
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additions.  Associated  with  each  set  of  additions  such  as  for  (lh) 
is  the  same  number  of  comparisons  (or  possibly  one  less).  In  the  case 
>  0  ,  the  same  sums  occur  in  and  for  the  same  (i,  J). 

Since  at  step  k+1  ve  do  not  need  to  consider  the  arcs  back  to  > 
the  total  additions  do  not  exceed  the  total  number  of  arcs.  We  will 
denote  this  total  by  A  .  The  procedure  is  to  sort  the  11^  + 
values  as  generated  from  low  to  high.  Let  the  lowest  sum  on  this 

list  be  nA  +  c^j  .  This  sum  on  the  list  is  deleted  if  II  ^ 

•  • 

has  previously  beer*  determined;  if  not  then  H  •  H  ♦  e.. 

J  * 

•  • 

Next  the  sums  II.  +  c .  ,  .  ,  .  . ,  t  .  .  \ 

3  j  k  are  computed  for  ail  arcs  (J,  k)  and 

•  f 

made  part  of  the  sorted  list.  The  process  is  then  repeated.  Sorting 
requires  effort,  however,  and  so  that  the  two  theorems  that  follow 
are  misleading. 

Theorem  8:  If  all  distances  c^  >  0  ,  then  the  number  of  additions 
using  formula  (14)  does  not  exceed  A  ,  the  number  of  arcs. 

Theorem  9:  The  number  of  additions  in  the  general  esse,  when 
formula  (3)  and  (8)  is  used  does  not  exceed 


(16)  A  ♦  nf  +  (n  -  1)  f.  +  .  .  .  f 
1  d  n 


where  n  ij  the  number  of  nodes,  f.  is  number  of  arcs  directed 
— - - - - '  k  - - - *  -  - - 

forward  from  the  k -th  node  to  enter  the  indue tion. 

This  suggests  preordering  from  low  to  high  the  nodes  by  the 
number  of  their  forward  arcs.  If  this  is  done,  the  bound  reduces  to 


(17) 


A  ♦  nfx  (n  -  1)  f2  ♦  ...fn  <  (n  ♦  3)  A Jz 
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Associated  with  each  arc  (i, j)  of  a  graph  are  two  numbers  the 

"cost"  and  t^  the  "time"  per  unit  flow.  In  our  application  the  unit  flow 
is  a  ship  making  one  trip  from  i  to  j  at  a  cost  c, ,  and  taking  t 

1J 

hours.  In  another  example,  a  vessel  for  hire  can  make  a  profit  p  .  each 
time  it  goes  from  i  to  J  j  eventually  (if  there  are  a  finite  number  of 
ports)  it  must  complete  a  cycle  with  a  total  profit  P  and  a  total  lapsed 
time  T  where  P  is  the  sum  of  the  profits  and  T  is  the  sum  of  the  times 
on  the  arcs  of  the  cycle.  For  a  maximum  rate  of  return,  the  shipowners  should 
use  that  cycle  which  maximizes  the  ratio  of  P/T  .  Later  we  shall  describe 
a  more  complex  linear  programming  model  which  we  solve  using  a  column 
generation  scheme  (a  variant  of  the  decomposition  principle).  The 
subproblem  turns  out  to  be  one  of  finding  a  cycle  in  a  graph  that  has  the 
minimum  ratio  of  total  cost  to  total  time. 


Consider  the  following  linear  program: 
Find  Min  z  ,  >  0  such  that 


(i) 


n 


Z  c 

i,3-l 


1J 


z 


n 


(2) 

2  ^11  1 
i,j-l  10 

‘us0 

(3) 

n  n 

2  x<<  “  2  x .  -  0 

i-1  10  k«l  J 

<1  “  1, 2, . . . ,  n 

♦Stanford  University 
**U.  S.  Steel 


theorem  1:  Associated  with  an  extreme  minimizing  solution  to  (1),  (2),  (3)  is 


a  cycle  whose  total  cost  to  time  ratio  is  minimum. 

Proof:  Let  *=  1  if  (i,j)  is  an  arc  of  some  cycle  and  =  0  otherwise. 
Let  E  t^  x^  =  T  ,  then  x^  =  T  satisfies  (2)  and  (3)  and  z  =  c/T  is 
the  ratio  of  total  costs,  C  =  2  I  c^  ,  to  total  time  T  .  Accordingly  we 
can  always  associate  with  a  cycle  one  of  the  solutions  of  (l),  (2),  (3)« 

Consider  now  the  class  of  minimizing  solutions  to  (l),  (2),  (3).  We  can  now 
see  that  to  an  extreme  minimizing  solution  corresponds  a  simple  cycle.  This 
follows  because  the  flows  x^.  >  0  can  be  represented  as  a  sum  of  simple 
circulations.  If  any  of  these  circulations  had  by  itself  a  lower  ratio 
^CiJXij/^tijXij  another  one,  the  solution  could  not  be  optimal.  Indeed 

an  improved  solution  could  be  obtained  by  building  up  the  circulation  around 
that  cycle  with  the  lowest  ratio  and  decreasing  the  flow  around  the  one  with 
a  higher  ratio.  Nor  could  a  solution  be  extreme  if  there  were  two  simple 
cycles  with  the  same  ratio  because  one  could  represent  such  a  solution  as  a 
convex  combination  of  two  others  by  first  building  up  and  then  building  down 
the  circulation  in  one  of  the  cycles  while  adjusting  the  other  so  (2)  holds. 


THE  SIMPLEX  ALGORITHM 

A  simple  algorithm  for  solving  (l),  (2),  (3)  can  be  derived  from  the 
simplex  method.  A  basis  involves  n  columns  (one  equation  is  redundant). 

The  corresponding  arcs  in  the  graph  must  consist  of  a  tree  and  one  out-of-tree 
arc.  To  see  this  we  note  that  since  a  basis  is  non-singular,  there  must 
be  at  least  one  non -singular  (n-l)x(n-l)  submatrix  formed  by  deleting  the 
row  associated  with  the  time  equation  and  deleting  some  column  of  the  basis. 
Non -singularity  implies  that  the  n-1  arcs  associated  with  the  remaining 
columns  form  a  tree.  The  arc  associated  with  the  variable  of  the  deleted 
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column  together  with  a  subset  of  the  arcs  of  the  tree  form  a  cycle. 

In  fact,  this  implies  that  every  basic  feasible  solution  to  (2)  and  (3) 
must  consist  of  one  tree  spanning  all  nodes  end  one  simple  cycle  formed  by 
a  subset  of  the  arcs  of  the  tree  augmented  by  one  additional  arc.  Ihls 
additional  arc  completes  the  cycle  making  possible  the  positive  flow  forced 


by  the  time  equation. 


variables  other  than  the  cycle  variables  have  zero  value  in  a  basic 
solution,  (b)  that  each  node  in  the  simple  cycle  has  one  cycle  arc  pointing 
into  and  the  other  away  from  it,  and  (c)  the  values  of  the  cycle  variables 
are  the  same  and  equal  to  l/T  where  T  is  the  total  time  around  the  cycle. 

It  is  also  easy  to  compute  the  simplex  multipliers  (prices)  associated 
with  the  basis.  Indeed  if  we  let  p  be  the  the  multiplier  associated  with 
the  time  equation  (2)  and  let  K .  be  those  associated  with  the  node 

u 

equations  (3)>  then  for  each  arc  (i,j)  associated  with  a  basic  variable 

nj  -  ”l  +  0  ■  C1J 

Summing  these  relations  for  all  arcs  (i, J)  c  Cycle  yields 

(5)  p  -  C/T  -  Z  c^  fz  t±J  (i,J)  €  cycle  arcs 
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Knowing  p  ,  one  nay  arbitrarily  choose  the  value  of  any  one  node  (there 
la  a  redundant  equation)  and  determine  the  remainder  by 

(^)  ITj  "  I^i  +  (c^j  -  p  t^)  (i,,))  €  tree  arcs 

by  branching  out  from  the  selected  node  along  arcs  (i, J )  of  the  tree 
until  all  nodes  are  reached. 

To  obtain  an  improved  solution  the  simplex  multipliers  are  used  to 
eliminate  the  basic  variables  from  the  cost  equation*  The  resulting 
coefficients  for  the  non -basic  variables  are 


(7) 


Sij  “  (cu  ” p  tip  “  ^  nj "  V 


If  all  >  0  ,  then  the  value  of  p  given  by  (5)  is  the  minimum  cost 
to  time  ratio  and  the  problem  is  solved. 

If  not,  let 


(8)  Bpq  <  0  for  SOme 

We  now  make  a  special  Inductive  Assumption:  at  each  iteration,  there  is  a 
basic  feasible  solution  consisting  of  a  directed  tree  spanning  out  from  a 
single  node  that  is  its  root,  augmented  by  one  additional  arc  to  form  one 
simple  cycle.* 


t  If  a  feasible  cycle  exists  in  the  graph  we  can  satisfy  the  inductive 
assumption  by  taking  any  node  of  the  cycle  as  the  root  of  the  tree  and 
spanning  out  from  the  nodes  of  the  cycle  using  forward  arcs  to  other 
nodes,  and  then  iteratively,  repeating  the  process  with  all  nodes  reached. 

If  nodes  still  remain  they  can  be  reached  by  introducing  high  cost  artificial 
arcs  as  required.  If  there  is  no  feasible  cycle  in  the  graph  there  is 
obviously  no  feasible  solution  to  the  problem;  if  such  is  indeed  the  case 
this  fact  will  be  discovered  by  the  algorithm  that  follows. 
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If  (p,q)  is  the  dotted  arc  marked  "IN  (Case  l)"  in  the  figure,  then  it  is 
easy  to  see  that  entering  (p>  q.)  into  the  set  of  basic  arcs  does  not  form 
a  new  cycle  and  ve  must  drop  out  of  the  basic  set  the  one  which  is  also 
directed  into  q  .  A  new  basic  feasible  set  of  arcs  is  obtained  with  the 
values  of  JI^  decreased  to  +  8^  for  node  i  =  q  and  all  nodes  i 
that  are  followers  of  q  in  the  new  tree.  All  other  remain 
unchanged.  In  Cases  II  and  III  a  new  cycle  is  formed  and  we  must  drop 
out  of  the  basis  any  one  arc  (r,s)  which  is  in  the  old  cycle  and  not  in 
the  new  cycle. 

Theorem  2;  If  the  inductive  assumption  holds,  and  if  /(p,q)  is  entered 
into  the  basic  set  in  place  of  the  basic  arc  directed  into  q  ,  then  the 
inductive  assumption  holds  after  the  change  except  when  a  new  cycle  is 
formed. 

Theorem  3:  If  a  new  cycle  is  obtained  as  a  result  of  changes  in  the  basic 
set  of  arcs,  its  p*  -  c/t  ratio  is  less  than  the  previous  one. 

Theorem  4:  If  d^  »  (c^  -  p  t^j)  is  used  as  distances  on  arcs 
(i, J)  in  the  graph,  then  any  cycle  in  the  graph  whose  stun  of  distances 
around  the  arcs  of  the  cycle  is  negative  has  a  lover  C/T  ratio. 

The  simplex  method  accordingly  reduces  down  to  finding  a  negative  cycle 
in  a  graph  when  arc  distances  d^  are  given:  Starting  with  11^  -  0  for  some 
node  of  the  cycle,  the  other  11^  are  simply  the  distances  from  this  origin 
node  along  arcs  of  the  tree  to  node  i  .  The  simplex  iterative  process  Is 
seen  to  be  the  standard  one  for  determining  the  shortest  route  from  the 
origin  to  all  others,  terminating  when  either 
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(9) 


au  -  <"j  -  V  i 0 


for  all  1, J 


or  on  some  Iteration  a  negative  cycle  like  In  Cases  II  or  II  Is  found.  In 
the  former  case,  it  is  seen  by  sunning  (9)  for  all  (i,j)  around  any 
given  cycle  that  distance  around  any  cycle  Is  non-negative  so  that  the 
p  used  to  determine  d^  =  c^j  -  p  la  the  best  ratio.  In  the  latter 

case,  a  new  cycle  vlth  a  lover  value  of  p  is  obtained  and  new  d^ 
values  are  computed. 

ALGORITHM:  In  the  following  algorithm  the  pairs  (i,  j)  represent  directed 
arcs  defined  In  the  graph: 

Or  Let  Sq  be  any  starting  cycle.  If  none  available,  set 

p  «=  Max  [c../t.  J  in  Step  2  below 
•  a0 

1:  For  k  =  0,1,. .  . 

2:  Compute  pk  ■=  I  c^/zt^  (i,j)  €  SJc 

3:  Compute  <3*j  =  c^  -  p^j  for  all  1,  J 

*  4:  Set  «=  0  and  set  predecessor  of  node  1  as  * 

(meaning  none). 

Set  11^  -  «  and  set  predecessor  for  1  /  1  as  1. 

5:  For  each  1  ■  1,2, ...,n  form  6«-a£)+ni for 
J  »  1,2,  ...,n  j/i * 

(a)  If  5  >  0  z’eturn  to  (5)  and  continue  scanning  J  for 

fixed  1  and  then  repeat  Increasing  1  to  1  +  1  until 
1  =  n,  J  •  n  -  1.  If  i-n,  J-n-1  terminate. 

Cycle  S^  is  optimal. 

-ff These  are  devices  to  initiate  the  computation  without  effort  and  to 
construct  the  starting  directed  tree  necessary  to  satisfy  the  inductive 
assumption. 
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i 

i 

j 

(b)  If  8^  <  0  go  to  (6). 

6:  Determine  the  nodes  In  reverse  order  along  the  route 

R  from  the  origin  to  i  by  back  tracing  the  predecessors 
of  i  . 

(a)  If  J  is  not  a  predecessor  of  1,  then  (change 
predecessor  of  j  to  1  and  replace  value  of 
■S*  "5  *  6ij  ,  return  to  (5)  with  i  ■  1, 

i  =  2. 

(b)  If  j  is  a  predecessor  of  1  ,  then  let 

+  ^  be  the  cycle  along  the  route  R  from 
node  j  to  i  and  back  to  J  along  arc 
(i,J).  Return  to  1  increasing  k  to  k  +  1. 

for  programming  simplicity  the  above  algorithm  does  not  maintain  a 
directed  tree.  If  it  is  modified  to  do  so,  the  nodes  can  be  priced 
sequentially  along  the  tree  and  the  return  from  step  6  (a)  to  step  (5) 
modified  to  take  advantage  of  this. 


7 


COMPUTATIONAL  EXPERIENCE 


Set  I 

Problem 

Nodes 

Arcs 

Seconds  on  IBM  1620 
(including  input -output) 

A 

4 

12 

3.60 

B 

4 

12 

2.16 

C 

5 

20 

3.96 

D 

4 

7 

2.52 

E 

6 

13 

6.84 

Set  II 

(Excluding  input -output) 

P 

5 

20 

1.80 

G 

10 

90 

7.92 

H 

15 

210 

14.04 

I 

20 

380 

33.84 

J 

25 

600 

36.00 

K 

30 

870 

103.68 

For  problems 

in  Set  H,  the  t 

*J 

values  for  each  arc  were  randomly 

generated  Integers  between  10  and  6o. 

Similarly, 

the  c^  values  were 

randomly  generated  between  20  and  '•20 

• 

We  do  not  have  an  upper  bound  on  the  number  of  operations  except  the 
kind  that  one  could  derive  from  a  standard  proof  of  the  simplex  algorithm. 
In  another  paper  where  a  variant  of  the  scanning  procedure  given  here  la 
used  an  upper  bound  of  (Nodes  +  3)  (Arcs)  additions -comparison  operations 
Is  given  for  finding  a  negative  cycle,  (lj. 
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Application  to  «  Ship  Routing  Problem 

Amounts  arc  required  to  be  shipped  from  porta  1  to  ports  J  . 

There  are  n  ports  (nodes).  The  shipping  can  either  be  done  by  charter 
at  a  cost  v^  per  unit  shipped  or  by  using  one  of  a  fleet  of  m  vessels 
under  the  control  of  the  shipping  company.  If  vessel  k  is  used  the  amount 
that  It  carries  between  (1, J)  depends  on  the  kind  of  ship  and  on  the 
pattern  of  ports  forming  a  cycle  g  that  Is  assigned  to  the  ship.  We 
denote  this  by  .  Thus  If  arc  (l,j)  18  not  part  of  cycle  g  ,  then 

a^  *  0  and  if  It  Is  its  value  is  the  capacity  w^j  of  the  vessel.  tTf 


Material  balance  equations :  For  1,  J  «=  1,  2,  .  ..,& 

yiJ  +  £  =  1  g  *kg  X  kg  “  by 

where  y^  is  amount  chartered  and  is  the  number  of  times  that  ship 

k  Is  employed  in  the  g-th  type  cycle.  We  allow  x^  to  have  fractional 
values  which  we  Interpret  as  rate  of  use  of  the  ship  In  some  given  period  of 


time. 


Vessel  hour  constraints: 


(u) 


1|  2, ... ,m. 


where  h^  Is  the  total  hours  available  on  the  k-th  vessel,  la  the  unused 

hours  of  the  ship,  t^  la  the  time  to  complete  one  cycle  of  type  g. 

Objective  to  be  minimised: 

(12)  r  yMM  •  Z 


fft Dependence  on  (i,j)  is  possible  If  type  of  cargo  on  route  (i,j)  Is 
different  from  that  on  other  arcs.  In  caae  of  airplanes,  rapacity  depends 
on  distance. 
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Here  we  are  assuming  that  the  cost  to  operate  vessel  is  c^  per  unit  time 

used,  hence  there  is  a  savings  of  c,  per  hour  not  used. 

In  an  ore  shipment  application  which  we  were  interested  in  there  were 

too  many  possible  cycles  to  explicitly  list  all  the  coefficients  of  the 

problem.  Accordingly  we  decided  to  generate  the  column  of  coefficients 

\ 

as  needed.  Using  y^  and  as  basic  variables,  one  has  a  starting 
basic  feasible  solution.  We  now  assume  we  have  introduced  into  the  basis 


several  other  columns  and  we  have  a  set  of  simplex  multipliers  p 
associated  with  (10)  and  q^  with  (ll).  Me  wish  to  "price  out"  the 
column  associated  with  nnd  to  find  that  column  g  for  each  k 

that  prices  out  most  negative.  Hie  relative  c ">Et  coefficient  of 

becomes 


<15>  a 


pij  “kg 


(i,J)eg  ^  1J  1,3  iJ 


Tc  kg  (lfj)  kg  (lf3)tg  *  1J  ij' 

(Xu:  subproblem  becomes  one  of  choosing  that  cycle  in  the  network  of  ship  k 
for  which  (13)  is  a  minimum.  Since  a.  J  c  w  is  the  ship's  capacity,  if 

Kg  1J 

the  arc  (i, j)  is  used  In  the  cycle  g  and  zero  otherwise,  the  sum  in  (13) 
ia  simply  the  sum  of  the  ship  capacities  on  arcs  (i,d)  weighted  by  p^ 
and  the  times  weighted  by  around  the  cycle  g  .  Note  that  is  the 

sum  of  times  on  arcs  (i,J)  around  the  cycle.  Unfortunately  the  problem 
in  this  form  is  that  of  finding  a  most  negative  cycle  in  a  graph  whoae  are 


distances  arc  given.  Ihi3  class  of  problems  Includes  as  a  special  case 


the  difficult  travelling  salesman  problem. 


We  got  around  this  difficulty  by  a  change  of  units.  We  set 
»  *kg^kg'  *^e  relative  coat  coefflcienta  for  the  new  problem  become 

<14>  -  'ip«vVV 

where  g  denotes  the  (i,j)  «  cycle  g 

k  ' 

Since  for  fixed  k  la  constant  and  t^  «  g  t^  the  aubproblem 
becomes  one  of  finding  that  cycle  g*  that  minimizes  the  ratio 

<«>  (i  *«  C«>/(1 

which  fortunately,  as  ve  have  seen,  la  a  solve able  problem' 
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1.  Pivot  Transformations 
Th*  a  chama 


rj*  ..  n  •< 


b  . .  «-y# 


(1.1) 


c  ..  d  ..  *-y 


•r  ..  -t 


conveniently  exhibit*  two  ay* tern  of  linear  equations,  a 


row  system 


(l.lr) 


*  •  • 

. .  +  arj*  +  . .  ♦  bi)  ♦  . .  ■  -y* 


+  cq#  ♦  ..  ♦  dtj  + 


In  which  the  dependent  -y-labeli  (variables  or  numerical  values )  are  expressed 
as  linear  combinations  of  the  Independent  q-labela;  and  a 


column  system 


(1.1c) 


♦  x*a  <*■  ..  ♦  xc  ♦  ..  ■  l 


. .  ♦  x*b  ♦  . .  +  »4  ♦  ..  ■  t 


In  which  the  dependent  (-labels  are  expressed  as  linear  combinations  of  the 
independent  x-labels.  ^ 

A  pivot  transformation  with  pivot  entry  a  i  0  simultaneously  re- 
expresses  the  pair  of  linear  systems  by  solving  the  -y#-equatioo  of  the  row 
system  for  and  the  (‘-equation  of  the  column  system  for  x#  ,  and  then 
using  these  equations  to  eliminate  q*  and  **  as  Independent  labels  firam 
the  rsnalnlug  row  and  column  equations.  y#  and  |*  thereby  benosm 
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Independent  label*. 

or 


Solving  for  n*  the  row  system 
..  ♦  »j#  +  ..  +  a'Hrj  ♦  •  -a^y* 

. .  a"1/*  +  . .  +  a'Hjj  +  . .  ■  -tj#  , 


and  hence, by  substitution, 

..  c( •  •  -a'V*  -  ••  -a-1bTi  -  ..)  +  ..  +dn  ♦  .. 


or 

Solving  for 


•  •  -ca  y*  +  . •  ♦  (d«ca^b)jj+  .. 

x#  in  the  coiusn  o ye ten 

«  -1  •  -l 

..  +  x  +  ..  +  xca  +  ..  *  t  a 


or 


♦  tV1 


-  xca 


-y  . 


-y 


end  hence,  by  substitution, 

*  -1  .1 

a  a  ♦  (.  a  +  {  S  -  a  a  -  XCft”  -  .  .  ft)  +  .  .  .  +  Xd  ■  ( 


or 


♦  {VH  ♦  . .  ♦  x(d  -  ea-1b )  ♦  . . 


Therefore,  the  reeult  of  asking  a  pivot  transformation  with  pivot  entry  a  /  0 
can  be  sunaarlied  In  the  transformation  of  the  scheaa  (l.l)  Into  the  scheaa 
(1.2) 


(1.2) 


In  word*  the  schena  (1.2)  Is  obtained  frea  the  s chews  (1.1)  by 

1.  Kxdnngtag  labels  corresponding  to  the  pivot  entry  a;  naaely, 
replacing  if  with  y#,  and  -y*  with  -ij#  In  the  row  ay* tea, 
and  replacing  x*  with  {*,  and  {*  with  x*  In  the  ooluan 

systea. 


3. 


2.  Keeping  all  remaining  labels  (e.g. ,  tj,  -y;  x,  g)  unchanged. 

3.  Replacing  the  pivot  a  by  its  reciprocal  l/a;  replacing  each 
remaining  entry  in  the  pivot's  row  b  by  b/a;  replacing  each 
remaining  entry  in  the  pivot's  column  c  by  -c/a;  and  replacing 
each  other  entry  d  in  b'e  coluan  and  c's  row  by  (d  -  ~)  . 

Clearly  the  only  a’ssult  of  making  a  pivot  transformation  is  to  re- 
express  or  re-pzesent  a  pair  of  linear  systems  in  terms  of  dlffsrent  seta  of 
Independent  and  dependent  labels. 


Bcawple.  (*  denotes  the  pivot) 

\  'i2  n3  \ 


y2  \ 


5  l/a  1/2  -3  |-y3 


conveniently  exhibits  e  dual  pair  of  linear  programs  in  standard  form 


Minimise 


Column  (or  Dual) 
Maximise 


v  -  ClTl  ♦  ...  4  Vl  ♦  «  »  -  Vl  ♦  —  ♦  Vm  ♦  4 

constrained  by  coos trained  by 

*Uyl  +  *'•  *  Vl  *  bl  "  0  Vll  4  •••  ♦  Vm  ♦  ci  *  *i>° 

(2.1r,c) 


*Mlyl  + 


V«  *  ®M 


♦  b„  -  0 


*X*1» 


^  •  e  e  ♦ 


*  c»  *  "i 


k 


Theorem  1.  If  (v;  satisfies  the  row  equation*  and 

(u;  x^, ... ;Xj|)  satisfies  the  column  equation*  then 

(2.2)  x^  ♦  ...  +  XjXj  •  v  -  u  . 

Theorem  2.  If  (v° ;  y®, ...#y°)  satisfies  the  row  constraint*  (2.1r)  (is 
"row  feeaible")  and  (u1;  x^,...,x£)  aatiafiaa  the  column  con¬ 

straint*  (2.1c)  (is  "coluan  feasible")  then 

u°  <  nH  oiaup  v  <  v'1 

.  *  • 

and 

u  <  maximirc  u  <  v'  . 

*  m 

Oorollary  l.  If  feasible  (^°j  y^,...,y^)  (u1;  k[,. x£,...,x£) 

can  be  found  such  that  v°  »  u'  ,  then  they  constitute  (optimal)  solutions 
to  row  and  colimn  programs.  Ibis  can  happen  only  if  (by  (2.2))  x^  »  0 
for  all  1  ,  that  is,  only  if  -  0  and/or  y^  •  0  . 

3.  Reduction  to  Canonical  form. 

In  order  to  solve  a  dual  pair  of  linear  programs  (2.1)  which  are  in 
standard  form  the  first  task  is  to  eliminate  the  unrestricted  ^variables 
from  the  column  program,  and  re -express  the  row  program  in  tens  of  the 
smallest  possible  eat  of  Independent  y- variables.  This  is  accomplished  by 
using  the 

Redaction  Buie.  Hake  pivot  transformations  on  (2.1)  and  its  suc¬ 
cessive  representations  with  pivot  antriss  corresponding  to 
O-dependent  end  y-lndepeodent  row  program  labels  and  x -dependent 
end  k- independent  column  program  labels  until  no  longer  possible. 

When  such  a  pivot  entry  cannot  be  chosen  then  a  representation  he*  been 
obtained  whose  schema  is  such  that  either 

(a)  every  entry  which  correspond*  to  O-dependsnt  sod  y-indepen&ent 
row  program  labels  end  to  x -dependent  and  ^-independent  oolumn  program 
labels  is  e  aero  and  thus  cannot  be  chosen  as  a  pivot;  or 

(b)  no  row  program  dependant  labels  are  O'e  end  no  column 
program  independent  labels  are  k‘s  . 

If  (a)  occurs  we  must  have  obtained  e  schema  of  the  form 


(3.1) 


y;+i  ••• 

yo+n 

0  see 

0 

i 

e 

®ii 

e 

*in  | 

e  < 

Vn 

s 

•ii 

s 

H 

m 

• 

e  ■ 

aal 

e 

amn  [ 

0 

amn+l  *** 

e 

aaH . 

# 

^1 

e 

0  ess 

e 

0  • 

s 

aafl,n+l*“ 

e 

aiUlH 

• 

« 

• 

0  see 

*  1 

0  | 

e 

“lto+l  *•* 

•i* 

1 

see 

cn  ! 

cn+l 

h 

d* 

mxLi 

u 

-  0 


»  ▼ 


5. 


where  the  prlaed  var.ables  are  a  rvarrangaaent  of  the  original  variables  in 
(2.1)  and  the  prised  entries  are  determined  ty  the  succession  of  preceedlng 


pivot  steps.  If  any  of  the  entries  b*^, . . .  Is  different  froot  sero  than 
the  row  program  constraints  are  Incompatible,  for  so**  row  equation  corre¬ 
sponding  to  a  dependent  0- label  would  read  Ka  non- zero  quantity  equals  sero." 
Then  no  optimal  solutions  exist.  Otherwise,  If  all  b^,...^  *  0  ,  the 
row  equations  corresponding  to  dependent  0-labals  reed  "sero  equals  aero” 
end  can  thus  be  emitted.  Clearly  the  columns  corresponding  to  the  Independent 
0-labels  can  also  be  emitted  from  the  point  of  view  of  the  row  program.  These 
sane  rows  and  colons  may  be  emitted  from  the  point  of  view  of  the  column 
program:  the  equations  corresponding  to  the  unrestricted  \£, . .  oan  be 


put  aside  since  they  represent  no  constraints;  then,  the  coefficients  corre¬ 
sponding  to  the  Independent  *••»*£  art  **ro*  •a4  bano#  t**1*  rov* 

■ay  be  omitted.  Hotloe  that  the  argument  aede  from  the  column  program  point 
of  view  holds  whether  or  not  all  b^, .  . .  ,b£  are  sero;  however,  If 
of  these  axe  not  sero  then,  clearly,  even  if  there  exists  a  oolvam 
feasible  solution,  the  value  of  the  objective,  maximise  u  ,  oan  be  aede 
arbitrarily  large  (vtoyt). 

m  sunary  If  (a)  occurs  and  all  b^,...,!^  are  aaro  we  obtain 
a  smaller  representation  for  the  dual  pair  of  linear  programs  (2.1)  whose 
Is 


6. 


(3.2) 


(>  0) 


(>  0) 


y^l 


y '  1 

•'n+n 


»-v.' 


*-y' 


«v 


(<  o) 


(min) 


8uch  a  representation  la  referred  to  aa  being  in  canonical  fora:  all  variables 
(except  u,v)  are  restricted  to  be  nonnegative,  and  each  equation  expresses 
•cm  variable  as  a  linear  combination  of  others. 


Alternative  (b)  is  a  special  case  of  (a)  with  m  -  M  ,  i.e.,  all 
labels  are  dependent,  all  0-labels  are  independent.  Therefore,  for  either 
alternative,  we  can  state 

Theorem  3.  If  solutions  exist  to  (2.1)  then  a  representation  in  canonical 
fore  (3.2)  for  the  dual  pair  of  linear  progress*  can  be  obtained  by  the 
reduction  rule  specified  above. 


h.  The  Ifcln  Theorem  of  Linear  Programing 

Theorem  4.  (Main  Theorem)  Given  a  duel  pair  of  linear  prog rase  in  canonical 
fore  there  exist#  a  finite  succession  of  pivot  transformations  which  obtain 
a  representation  for  the  progress  whose  tehees  has  exactly  one  of  the 
following  four  forts  {#,  A  denote  nonnegative  and  nonpositive  entries, 
respectively,  and  Marginal  variables  have  been  omitted ) : 


(4.a) 


1 


exhibiting  optimal  solutions 
to  both  progress; 


f 
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1 

exhibiting  a  feasible  solution  to 
the  row  program  and  the  unbound ad neas 
(from  below)  of  vj  and  showing 
the  nonccoputibility  of  the  column 
constraints ; 

«Y 
-U 


(k.c) 


exhibiting  a  feasible  solution  to 
the  column  program  and  the  unbound - 
edness  (from  above)  of  u  ;  and 
shoving  the  nonccaqpatlblllty 
the  row  constraints; 


(k.( 


•bowing  the  nonccopetlbillty 
of  both  row  and  column  constraints. 


(Proof  of  Theorem  4  will  be  given  below) 


(2) 

Pore,  ary  2.  If  there  exist  feasible  solutions  tc  both  row  and  colimn 
prof)  as  then  there  must  exist  opt Inal  solutions. 


Proot  If  there  exist  feasible  solutions  to  both  progress  then  cases  (b), 
(c),  it)  cannot  occur,  and  bancs  ca.a  (e)  must  occur. 


Qo rollary  2  Is  usually  referred  to  as  the  fundamental  duality 
theorem  of  linear  progressing. 


A  simplex  method  for  solving  a  pair  of  dual  linear  programs  (3.2) 
in  canonical  form  is  a  finite  sequence  of  tehees ta  or  equivalent  represents- 
tioos  for  the  progress  obtained  by  successive  pivot  steps,  with  prescribed 
pivot  entry  choloe  rules,  which  obtain  a  schema  exhibiting  optimal  solutions 
to  both  programs,  or  the  ooo-«mt>*tibliity  of  the  row  sad/or  the  column 
constraints . 


8. 


(4) 

A  tow  (or  primal)  simplex  method  is  a  simplex  method  beginning  with 
a  s cheat,  exhibiting  row  feasibility  with  pivot  steps  which  maintain  row 
feasibility  in  each  succeeding  schema.  Thus,  at  any  stage,  a  schema 


(M) 


with  b’  <  0,.../o^  <0  is  at  hand. 


A  row  (or  primal)  pivot  choice  rule  is  as  follows; 

If  a  schema  (?.l)  does  not  exhibit  optimal  solutions  to  both  programs 
(form  4. a)  there  must  exist  a  c*  <  0  for  some  J  .  Either  (i)  every  entry 
in  the  column  of  ej  <  0  is  nonpoeitive  or  (il)  there  exist  positive  entries. 

(1)  lbe  schema  is  in  form  (4.b) 


(li )  Choose  as  pivot  entry  >  0  satisfying 


aaj>° 


Sotlce  that  if  b£  <  0  a  new  row  feasible  solution  is  exhibited  after  pivoting 
which  gives  a  value  to  v  strictly  less  than  the  previous  exhibited  value  of  v. 


Sample.  Solve  the  linear  programs 


(>0) 


5/3  -ty3  -5/3  -5/3 

«u|  1/3*  1/3  2/3  -1/3]  -y4  x3 

1  1-1  2  2  I  -21 «  v  1 

"*3  **2  **5  "*4  "*2  **5  ““ 

(5) 

A  column  (or  dual )  simplex,  method  is  a  duplex  art&KXi  beginning  with  a 
schema  exhibiting  eolusm  feasibility  with  pivct  steps  which  Maintain  ooluan 
feasibility  in  each  succeeding  uchaase.  Ihus,  at  say  stags,  a  sdbsna  (5.1) 
with  >  0,..,,c^  >  0  ,  is  at  hard. 

A  column  (or  dual)  pivot  choice  rule  is  as  follows: 


If  a  schema  (5*1)  does  not  exhibit  optimal  solutions  to  both  progress 
(fora  4. a)  there  Bust  exist  a  b£  >  0  for  ease  i  .  Either  (i)  every  entry 
is  the  row  of  bj  >  0  is  nonnegativ*'  -  (ii )  there  exist  scae  negative 
entries . 

(i)  The  Rehear.  Li  In  fond  (4„e). 

(ii)  Choose  as  pivot  entry  a \ ^  <  0  satisfying 


‘is  *  0  *is  * 

Sotice  that  if  cj  >  0  a  rew  column  feasible  solution  is  exhibited  after 
pivoting  which  gives  a  vain*  to  u  strictly  greater  than  the  previous  ex-, 
hibited  value  of  u  . 


Example.  Solve  the  linear  p: 


y*  1 


”3 

-1 

0 

3 

-4 

-3* 

-1 

6 

-1 

-2 

♦1 

2 

2 

1 

1 

0 

"*1 

"*2 

**3 

-u 

(*  0)  (sax) 


-5/3  '  -1/3  1/3 

4/  j  -1/3  1/3 

5/3  -2/3  5/3 

1 

-2 

-2 

2/3  1/3  W 

2 

-3/5  1/5  -1/5 

ty5  -3/5  3/5 

1-12 

-3/5 

-6/5 

-1 

“275  175  575 

wr 

*5  -*3 
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The  row  and  coluec  simplex  methods  described  above  can  be  applied 

only  to  a  pair  of  proems  whose  schema  exhibit*,  respectively,  a  row  or  a 

colon  feasible  solution.  Tble  is  not  always  the  case,  of  course.  A  aethod 

for  obtaining  a  schema  exhibiting  a  row  feasible  solution  (if  one  exists) 

.  - _______ — — 

can  be  described  as  follows. '  ' 

Suppose  that  the  scheme.  (%l)  does  not  exhibit  a  row  feasible 
solution _  i.e.,  bj  >  0  for  sene  1  .  Assume  that  the  ncopoaltive  b| 
are  b*  <  0, ...,b*  <  0  (this  is  not  a  limiting  assumption  for  the  rows  of 

Xs  E  * 

(5*1 5  could  be  rearranged  to  accomplish  this).  'Then,  by  replacing  the  bj 
with  their  signs,  the  schema  (5.1)  can  be  exhibited  as 


(5.2) 


Kn  • 

^m+n 

1 

4 

e 

*u  * 

•  *  Hn 

e 

tl 

1 

‘  41 

e 

*k+l 

e 

it 

• 

ft  • 

* 

k 

■il  • 

.  .  a‘ 

am 

e 

a 

• 

+ 

e 

0 

e 

=-y  * 

1 

cl  • 

c  * 

•  •  cn 

d* 

=  V 

~  •  "In 

In  (%2)  the  sub-schema  above  the  double  line  can  be  thought  of  as  speci¬ 
fying  a  pair  of  subprograms  in  canonical  form  in  which  a  row  feasible 
solution  obtains.  Therefore,  a  row  2iag>l*x  matuod  pivot  choice  rule  can  be 
used  directly  in  (temporarily)  minimising  -y£+1  subject  to  the  constraints 
as  specified  above  the  -y£fl  row.  If,  aftor  oue  or  toevwrel  pivot  steps, 
the  sign  of  the  entry  corresponding  to  b^+1  be comes  nonpositive,  then 
one  more  row  with  "correct"  sign  has  been  generated  and  there  is  a  larger 
subschema  (perhaps  the  entire  schema)  which  cm  be  thought  of  as  specifying 
a  pair  of  subprograms  in  which  a  row  feaslole  solution  obtains.  Otherwise, 
one  of  the  two  following  forms  can  be  reached  (by  Theorem  4): 


12. 


f 


A  method  for  obtaining  a  column  feasible  solution  (if  one 
exists)  can  be  described  analogously  (the  description  is  left  to  the 
reader). 

f  7) 

6.  Proof  of  the  Main  Theorem. 

Suppose,  as  in  the  statement  fo  the  theorem,  that  a  dual  pair 
of  linear  programs  in  canonical  form  (3*2)  is  given  vlth  m  +  n  ♦  2 
the  number  of  rows  plus  columns.  Before  preceding  to  a  proof  notice 
that  two  corollaries  are  immediate  consequences  of  the  theorem. 


Corollary  3.  Given  a  dual  pair  of  linear  programs  in  canonical  form 
(3.2)  there  exists  •  finite  succession  of  pivot  transformations 
which  obtain  a  representation  for  the  programs  whose  schema  has  exactly 
one  of  the  following  two  forms  (where  the  Identity  of  the  column 
with  label  1  la  not  distinguished): 


(6.e) 


(6.f) 


V 


Proof.  If  (4. a),  (4.b)  or  (4,d)  hold  then  either  (6.e)  or  (6.f) 
obtains.  If  (U.c)  holds,  but  (6.e)  does  not  obtain  then 


obtains)  but  with  one  pivot  step  with  pivot  entry  designated,  (6.4) 
obtains. 


I 
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Corollary  4.  Given  a  dual  pair  of  linear  programs  in  canonical  fern 
(3>2)  there  exists  a  finite  succession  of  pivot  transformations  which 
obtain  a  representation  for  the  programs  whose  schema  has  exactly 
one  of  the  following  two  forms  (where  the  identity  of  the  row  with 
label  1  is  not  distinguished): 


(6.g) 


(6.h) 


or 


•  e  • 


«u 


Proof.  The  proof  is  analogous  to  that  for  corollary  3  and  la 
left  to  the  reader. 


The  proof  of  the  Main  Theorem  le  accomplished  by  induction  on 
the  maber  of  rove  plus  columns.  If  there  is  either  only  one  row 
or  only  one  column  in  (3*2),  one  of  the  forms  of  the  theorem  obtains. 
Suppose,  ther,  the  theorem  true,  and  hence  Corollaries  3  and  4  true, 
for  the  number  of  rows  plus  colians  less  than  a  +  n  ♦  2 


Applying  tbs  inductive  hypothesis  to  (3-2)  with  tbs  last  row 
ignored  we  must  obtain,  by  corollary  4,  either  (after  rearrangement  of 
rows  and  columns  according  to  signs) 


(6.n 


r 


0  ..  0 


1 


«  V 


I 


Iko 

in  a  finite  number  of  pivot  steps.  Notice  that  the  induction  at  this 
point  corresponds  to  obtaining  a  rov  feasible  solution  or  shoving  non® 
exist  (compare  with  the  constructive  procedure  in  Section  5*  end,  in 
particular,  with  (3.3a,l>)). 

If  (6.j)  obtains,  applying  Corollary  3  to  the  part  at  (6.3) 
vbose  row  and  col;  norm  are  bracketed  {and  are  less  than  m  +  n  f  2  in 
masher)  ve  must  obtain  either  fora  (4.c)  or  form  (4.d)  or 

(6.k) 


In  a  finite  number  of  pivot  steps,  where  (6.1c)  contains  one  or  more 
negative  entries  in  the  bottom  rov  pf  the  bracketed  columns .  But  then 
it  Is  possible  to  pivot  on  a  positive  entry  of  the  top  rov  of  the 
bracketed  columns  and  obtain  form  (4.c)  (why?).  Notice  that  the 

induction  at  this  point  corresponds  to  obtaining  a  column  feasible 
solution  or  shoving  none  exist  (compare  with  the  analogous  orstructive 
procedure  mentioned  in  Section  5  for  obtaining  column  feasibility  in 
tbs  bracketed  columns ) . 

If  (6.1)  obtains,  applying  Corollary  3  to  the  part  of  (6.1) 
whose  row  and  columns  are  bracketed  (and  are  less  than  m  +  n  +  2  in 
number)  ve  must  obtain  either  form  (4. a)  or  form  (4.b)  or 


<6.<) 


f 

in  a  finite  number  of  pivot  steps,  where  (6.i)  contains  one  or  acre 
positive  entries  in  the  column  indicated  by  an  arrow.  But  a  row  etqplex 
method  pivot  choice  rule  can  be  applied  to  (6.1)  and  the  value  of  the 
southeast  entry  strictly  decreased  after  pivoting*  Continued  application 
of  a  row  simplex  method  pivot  choice  rule  or  of  the  inductive  hypothesis 
as  applied  to  a  form  (6.1)  then  assures  that  the  value  of  the  southeast 
entry  Is  never  increased  and  is  successively  strictly  decreased 
within  finite  numbers  of  pivot  steps.  Since  there  are  at  most  e  finite 
number  possible  different  schemata  (p-t  most  (*  +  n)  )  and  tha  south¬ 
east  entry  must  successively  decrees;,  e  schema  must  bt  reached  after 
a  finite  number  if  pivot  steps  In  which  application  of  the  pivot  choice 
rule  or  of  the  inductive  hypothesis  is  impossible,  that  is,  form  (4. a) 
or  form  (4.b)  obtains.  Ifotice  that  the  induction  at  this  point 
corresponds  to  obtaining  a  row  feasible  solution  or  ahowlx^  none  exist 
(compare  with  a  row  simplex  method). 

This  completes  the  proof.  Notice,  however,  that  to  every 

application  of  the  inductive  hypothesis  there  correspond*  a  constructive 

conrputntional  set  of  rules  for  choice  of  pivot  entry  which  achieve  the 

sor.e  results.  Th^ee  rules  are  rov  and  column  sijoqplex  method  pivot 

(8) 

choice  rules  applied  to  appropriate  ouhsela  of  row*  and  colimn*.  ’ 


7.  Matrix  Panes 


A  Matrix  gone  A  (or  two-person  zero-sum  game  in  normalized  fora) 
is  specified  by  any  matrix  of  real  numbers 


(7-1) 


with  the  rule  that  in  a  play  of  the  game  players  I  and  II  simultaneously 
choose  same  row  1  (i  -  l,...,m)  and  colwo  J  (j  ■  l,...,n), 
respectively,  with  the  result  that  II  pays  I  an  amount  a  < 


one  column 


A  pure  strategy  for  player  I  is  the  choice  of  seme  one  rov 
of  A  )  a  pure  strategy  for  player  II  is  the  choice  of  some  one  colune 
of  A  .  A  mixed  strategy  for  player  I  is  a  probability  vsetor 
X  »  (Xjj...,^)  ,  xi  •  0  all  i  ,  £  xi  »  1  (in  a  great  many  plays 


of  the 


I  chooses  row  i  with  probability  x^;  a 


for  player  II  is  a  probability  vector  Y  -  (y1#...,yn),  y^  *  0  all  J  , 
J  y^  ■  1  (in  a  great  many  plays  of  the  game  II  chooses  coltaan  J  with 
probability  y^ ) . 

If  X  •  (x^...,*^)  and  Y  -  (y1»..-,yn)  are  any  two  mixed 
strategies  for  players  I  and  II,  respectively,  the  schema 


*1  [  *11 


*ln  *J1 


(7.2) 


-.[v 


17 


conveniently  exbibite,  in  the  coluan  system, 


playera  I 'a  ejected  gains 


(7.2,1) 


*1  *11  + 


*  **  ‘il"8! 


*1  ‘in  * 


"  *n 


•gainst  each  of  II 'a  columns  (or  possit'e  pure  atrataglaa);  and,  in  the 
row  system,  player  II' a  expected  loaaea 


*n  *1  * 


*  *1»  ’n  *  *1 


(7.2,11) 


•  .  y,  +  . . .  ♦  a 

ml  1  ■ 


y  -  1 
'n  a 


•gainst  each  of  I 'a  rove  (or  possible  pure  strategies). 

The  rationale  of  the  theory  of  gsaaas  requires  that  player  I 
choose  his  ainad  strategy  X  so  aa  to  aaxlaiae  his  alnl—  expected 
gain  against  any  choice  of  colts®  by  II,  and  that  player  II  choose  his 
nixed  strategy  Y  so  as  to  liniiixe  bis  waytaua  expected  loss  against 
any  choice  of  rov  by  I.  Letting 

(7<3)  u  -  lin  g.  and  v  •  nex  1 

J  J  i 


this  isans  player  I '•  objective  is  to  choose  an  X  to  eaxialte  u  , 
and  player  II 'a  objective  la  to  choose  a  Y  to  liniilae  v  Clearly 
u  ■  v  Thus  if  an  X  and  a  Y  can  be  found  such  that  u  •  v  they 
ust  constitute  optimal  strategies.  The  ccasoo  value  u  »  v  is  then 
called  the  value  of  the  t —  A 


Ihe  objectives  of  the  playera  aay  be  fonsulated  as  linear 
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program*.  Namely,  player  I1*  objective  ie  to 

Maximize  u 

constrained  by  x  +  . . . 


(7-4,1) 


*1  *11  + 


*1  %  + 

x.  >  0  . 


+ 


+  xm  aml  “  h  *  u 


> 

+  x  a  »  g  ■  u 
m  am  n 


,  x  >0 

D  “ 


and  player  II1*  objective  is  to 

Minimize  v 

cone trained  by 


all  yl 


(7.4,11) 


*»lyl 


+ 

+ 


y! 


♦  yn  "  1 

+  *ln  yn  "  *1  t  v 


a  y  *  J  <  v 
am  n  n  ■ 


,  y  >o 

n  » 


Theae  linear  programa  are  conveniently  exhibited  in  the  schema 

(>  0) 


(7.4)  (>  0)  • 


-v  y. 


u 

0 

-1  ... 

i* 

*1 

1 

’  •  •  • 

»  i.1 

1 

*ln 

* 

• 

*  . 

• 

• 

• 

• 

• 

1* 

4 

:  v  *  •  • 

a 

an 

•1 

'Bi  •  •  • 

•a 

n 

-  -i 

•  -t. 


(<  0) 


-  -t 
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where  the  column  system,  with  "slack*1  variables  flj  "  8j  -u  >  0  , 

refers  to  player  I's  program  of  maximizing  u  ,  and  the  row  system, 
with  "slack"  variables  t  ■  v  -1^  >  0  ,  refers  to  player  II* s  program 

of  minimizing  v 


Two  successive  pivot  transformation  on  the  starred  +1  and  >1 
of  (7 .4)  obtains.  the  representation  for  the  progress  exhibited  in  the 

schema 


(>  0) 


.  (<  0) 


-t 


»-l 

v  (min) 


where 


1J  iJ 


-  a 


In 


■J 


♦  a 


U  i  »>  i  t  *») 


(7.6)  and 


in 


in 


(J  /  n) 


But  (7-5)  exhibits  equivalent  representations  of  the  linear  programs 
(7.^Ij  and  (7-**, II)  as  a  dual  pair  of  linear  programs  in  canonical 
fora-  This  observation  permits  us  to  state 

Theorem  rjK  (Minima*  Theorem).  There  always  axlat  optimal  mixed 
strategies  X  and  Y  for  players  I  and  II,  respectively,  i or 
which  u  •  v  . 
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Proof.  To*  dual  pair  of  linear  progress  (7*4,1)  and  (7.4,11)  each 
have  feasible  solution,  as  is  easily  verified  directly.  Therefore, 
by  Theorem  4, fora  (4. a)  oust  obtain,  establishing  the  theorem. 

To  find  optimal  mixed  strategies  and  the  value  of  the  game  A 

we  need  only  to  solve  the  .ual  paiv  of  linear  programs  exhibited  in  (7*4) 

or  (7*5)  by  applying  a  sjjaplex  method.  Botice,  however,  that  if 

•■n't8*  then  (7,5)  exhibits  a  column  feasible  solution,  while 

if  a  *  ©ax  a  then  (7-5)  e>Jhibit«  a  rov  feasible  solution,  'Riere- 
mn  i  ~n 

fore,  if  the  rows  or  colt  Tana  of  A  ere  rearranged  so  that  one  of 
these  conditions  hold  s  .olumn  or  a  row  simplex  method  can  be  applied 
directly. 


Furthermore,  if 

(7'7)  "J"  V)  ■  So  ■  T  *u 

then  optimal  solutions  obtain  in  (7-5)  with  x  *  1  ,  y  «=  1  and 

a  n 

u  *  v  *  a  that  is,  the  optimal  strategies  are  pure  strategies..  The  entry 
on 

is  called  a  saddle  point,  and  we  can  state 
Theorem  6.  If  akjJ  ia  a  saddle  point  of  A  ,  i.e.,  If 

(7-fl)  *  <  (all  i,j) 

1  -  a*  ..j 

then  optimal  pure  stragles  exsit  with  x^  ■  1  and  y^  *  1  ,  and  the 

value  of  the  game  is  . 

Examples. 

Solve  the  matrix  game  A  specified  by 

0-1  1-1  7 
A  -  ’7  1  *3  2-8 

3  <D  2  6  3 

2  ©  3  4  5 
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The  circled  entries  are  saddle  points;  therefore  player  I  will 
use  pure  strategy  row  3  or  row  4  (or  any  nixed  strategy  using  only 
a  combination  of  rows  3  and  4),  and  player  II  will  use  pure  strategy 
coluan  2  The  value  of  the  game  is  2  . 

Solve  the  antrix  game  specified  by 

'4  1.4  2-3 

,  -2  3  2  3  2 

A  * 

-1-2113 

-2-1102 


Notice/  first,  that  rov  2  dominates  row  4  it  the  sense  that  player  I 
can  on!/  gain  more  by  playing  row  2  rather  than  row  4;  moreover/  given 
thet  player  I  never  plays  row  4  ,  column  2  dominates  coluan  4 
and  column  3  dominates  coluun  5/  hence  player  II  should  never  play 
column  4  or  column  5  •  Thus  ■  0  and  y^  ■  ■  0  in  optimal 

mixed  strategies.  Rearrange  rows  and  columns  in  A  with  row  4  and 
columns  4  and  5  omitted  so  that  the  southeast  entry  is  a  maximum 
In  its  column  and  construct  the  schema  corresponding  to  (7.4)  and 
pivot  as  Indicated:  ol 

maximize  u 

minimize  v 

(>  0) 


Thus  optimal  mixed  strategies  are 

X  «  (l/4,  1/4,  1/2,  0)  and  Y  -  (l/2,  0,  l/2,  0,  0) 

and  the  value  of  the  game  is  zero  (it  Is  fair).  Notice,  however,  that 
another  pair  of  optimal  mixed  strategies  can  be  found  by  pivoting  as 
indicated  to  obtain 

*2  y2  *1 _ ]_ 

*3  p7a  l/i  -l/l2 1 -1/2 1"  -y 

x3  -5/6  -28/6  -l/6  0  «■  -t3 

•1  [li ll _ =  -yx 

1  L2/3  14/6  1/3  0  1-  v 

**2  °e2  MX1 


Thus  optimal  mixed  strategies  are 

X  •  (l/3*  2/3,  0,  0)  and  Y  »  (l/8,  0,  l/2,  0,  0). 
Therefore 

X  «  a( l/3,  2/3,  0,  0)  +  (l-a)  (1/4,  1/4,  1/2,  0),  0  <  a  <  1 

constitutes  an  optimal  mixed  strategy  for  player  I  for  any  choice 
of  a  between  tero  and  one.  (Why?). 


Footnotes 


(1) 


(2) 


(3),(4) 


(5) 


(6) 


(7), (8) 


(9) 


(10) 


Also  known  as  complete  elimination,  or  G»ubs -Jordan 
elimination.  See:  E.  Stiefel,  "Note  aa Jordan  Elimination, 
Linear  Programing,  and  TeLebycleff  Approximation", 
Numerlache  Mathematik  vol.  2  (i960),  pp.  1-17* 


Corollary  2  is  known  as  the  "fund  mental  duality  of 
lineal  programming".  Its  first  explicit  statement  is 
contained  in  D.  Gale,  H.  W.  Kuhn,  and  A.W.  Tucker,  "Linear 
Programing  and  the  Theory  of  Games",  in  Activity  Analysis 
of  Producatlon  and  Allocation,  (tilted  by  T.  C.  Koopmans) , 
John  Wiley  and  Sons,  Inc.,  New  York,  1951*  The  notion  of 
a  duality  theory  arose  from  the  Minimax  Theorem  of 
J.  von  Neumannjsee  footnote (10) ). 


The  discovery  of  a  (primal)  simplex  method  (1947)  le  due 
to  George  B.  Dantzig.  His  original  paper  "Maximization  of 
a  Linear  Function  of  Variables  Subject  to  Linear  Inequalities" 
is  contained  in  Activity  Analysis  of  Production  and  Allocation. 


A  dual  simplex  method  was  first  expliclty  advanced  by 
C.  C.  Leake  in  "The  Dual  Method  of  Solving  the  Linear 
Programing  Problem",  Naval  Research  Logistics  Quarterly. 
vol.  1,  (1954),  pp.  36-37. 


This  method  is  described  by  M.  L.  Balinski  and  R.  E.  Garnery 
in  "A  Mutual  Primal-Dual  Simplex  Method"  to  appear  in 
Proceedings  of  Symposium  on  Mathematical  Programming  held 
in  Chicago,  June,  1962. 


The  proof  given  here  depends  on  the  basic  idea  advanced 
in  the  paper  referred  to  in  (6),  where  the  proof  is  completely 
constructive.  The  inductive  proof  found  here  le  due  to 
A.  V.  Tucker. 


The  formulation  of  the  players'  problems  as  linear  program 
given  here  follows  A.  W.  Tucker's,  "Solving  a  Matrix  Gam 
by  Linear  Programing",  IBM  Journal  of  Research  and  Develop¬ 
ment,  voi.  k  (i960),  ppT3o7^5rT - 


The  Minimax  Theorem  was  discovered  and  first  proved  by 
J.  yon  Nfumann.in  "£ur  Theorie  der  Oesellschaftsspiele", 
Matbeaatische  Annalan,  vol.  100  (1926),  pp. 295-320. 
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DUALITY  IN  NONLINEAR  PROGRAMMING 
R,T.  Rockafellar* 

1.  Introduction 

A  nonlinear  program  in  n  variables  is  usually  described 
as  a  problem  of  minimizing  (or  maximizing)  a  quantity  fQ(x) 
subject  to  constraints  f^(x)  <  0,...,fffi(x)  <  0,  where 
f  , ...,f  are  certain  real-valued  functions  of  the  vector 
x  =  (x^,...xn)  e  Rn.  The  problem  may  be  interpreted  broadly 
or  narrowly,  however. 

In  the  narrower  sense,  one  is  only  interested  in  the 
infimum  of  a  certain  function  given  on  a  subset  S  of 
Rn.  The  elements  x  of  the  subset  S  are  the  so-called 
feasible  solutions  to  the  problem.  Typical  questions  are 
the  following.  Is  the  infimum  finite?  Dc  there  exist 
o:  timal  solutions,  i.e.  feasible  solutions  at  which  the 
infimum  is  attained?  Is  there  only  one  optimal  solution? 

One  seeks  conditions  which  guarantee  "  yes "  answers  to  these 
questions,  as  well  as  algorithms  for  actually  computing  the 
infimum  and  optimal  solutions. 

In  the  broader  sense  of  the  problem,  one  is  also  concerned 
with  the  sensitivity  of  the  infimum  and  optimal  solutions  to 
slight  changes  in  the  constraints.  This  is  where  duality  and 
La  range  multipliers  cone  in.  Let  p(u^ , . . . ,uffi)  denote  the 
infimum  of  f  (xl  subject  to  f,  (x)  <  u,  (x)  <  u  . 

•This  work  was  supported  in  part  by  grant  AF-AKOSR-1 POP-67 
from  the  Air  Force  Off i ce  of  Scientific  Pru'-nrch. 
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One  is  interested  in  the  properties  of  p  as  a  function  of 
the  perturbation  u  *  (u^,...,um)  e  Rn  near  u  =  0.  For 
instance,  is  p  continuous  or  di fferentiable  at  u  =  0? 

It  is  especially  important  to  look  for  numbers 
U2*,...,um*  such  that 

( 1  •  1)  p(ux ,  •  •  •  ,  u^)  ^  *^l”*  * ;  —  ^m*^ni  *  ^  .  ,  u^)  tR  . 

Such  numbers  can  be  interpreted  as  "  equilibrium  prices,"  if 

the  objective  function  fQ  is  interpreted  as  a  cost  function. 

Suppose  that  in  trying  to  minimize  cost  we  are  allowed  to 

perturb  the  given  problem  by  any  amount  (u^,...,u  ),  but 

that  this  perturbation  must  be  paid  for,  the  price  being 

u^*  per  unit  of  variable  u^.  The  minimal  cost  attainable 

in  the  problem  perturbed  by  (u^,...,u  ),  plus  the  cost  of 

this  perturbation,  is 

p(u,,...,u  )  ♦  u.’u,*  ...  +u  *u. 
l*  *  m  11  mm 

If  the  prices  satisfy  (1.1),  this  is  never  less  than  t:  e 
minimal  cost  p(0,...,0)  in  the  unperturb  d  problem,  so  all 
the  incentive  for  perturbation  is  neutralized  and  there  is 
an  "  equilibrium". 


Observe  that  (1.1)  is 

satisfied  if 

and  on’. y  if 

f0(x)  ♦  Uj'Uj. 

*  *u  *  i  of 

-»•••» 0) 

for 

every  choice  of  x  an- 

(«!,.. .,uB) 

sue:;  that  f. 

i 

for 

i  -  l,...,m.  Assuming 

p(0, , , , ,0) 

is  finite,  tr.i 

condition  is  equivalent  to 

the  condition 

that  u*  >  C 

i  - 

1  «  1 , . . . ,o  and 

fQ(x)  ♦  Uj'fjCx)*  ...  ♦unTn(x)  ^  r(0t...,0\  V*  - 


In  other  words,  the  equilibrium  prices  are  the  same  as  the 

non-negative  Lagrange  multipliers  u^ u  •  such  that  the 

unconstrained  infimum  of  f  +  u,  •  f,  +  ...  ♦um*ftB  coincides 

0X1  mm 

with  the  infimum  of  f  subject  to  the  constraints 

^(x)  <0,  ...  »fm(x)  <  0. 

These  reflections  on  the  nature  of  a  classical  nonlinear 
program  lead  us  to  propose  a  new  concept  of  a  generalized 
nonlinear  program  as,  not  just  a  single  minimization  problem, 
but  a  minimization  problem  with  a  built-in  class  of  perturbations . 

In  such  a  program,  one  is  to  study  not  only  the  infimum  in  the 
problem  corresponding  to  zero  perturbation,  but  also  the  sensitivity 
of  the  infimum  with  respect  to  perturbations  to  neighboring 
problems.  The  Lagrange  multipliers  are  to  be  the  "  equilibrium 
prices"  for  the  perturbations. 

Suppose  that  for  each  vector  u  e  R®  we  are  given  a  pair 
(Su,  Fu) ,  where  Su  is  a  subset  of  Rn  (possibly  empty) 
and  Fu  is  a  function  on  Su  with  values  in  C-®  ,  *00  J. 

The  correspondence 

F:  u  -  (Su,  Fu) 

will  be  called  a  bifunction  from  R®  to  Rn.  A  bifunction 
is  to  be  regarded  as  a  generalization  of  "  multivalued  mapping"  : 
the  image  of  u  under  F  is  not  just  a  set,  but  a  set  with 
a  distinguished  function  attached  to  it.  One  can  interpret 
the  function  Fu  as  assigning  a  relative  value  or  cost 
'Fu)(x)  to  each  element  x  of  the  set  Su. 
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Por  any  bifunction  F  from  Rm  to  Rn,  we  define  a 
generalized  program  (P):  minimize  the  function  FO  on  the 
set  SO.  The  problem  is  to  include  the  local  analysis  of 
the  properties  of  the  function  p  *  inf  F  at  u  *  0,  where 
(inf  F)(u)  «  inf  ^(Fu)(x)  I  x  t  Su 
(By  convention,  an  infimum  is  +®  if  the  set  over  which  it 
is  taken  is  empty.)  A  vector  x  e  Rn  will  be  called  an 
optimal  solution  to  (P)  if  (inf  F^(0)  is  finite  and 
attained  at  x.  If  (inf  F)(0)  is  finite,  we  define  a 
Lagrange  multiplier  vector  for  (P)  to  be  a  vector  u*  e  Rm 
such  that 

(inf  F)(u)  ♦  <u*,u>  >  (inf  F)(0) 
for  every  perturbation  u  e  Rm.  (Here  <  •,•  >  denotes  the 
ordinary  inner  product  of  two  numerical  vectors.) 

Under  simple  convexity  assumptions  on  the  bifunction 
F,  a  comprehensive  duality  theory  is  possible  for  such 
generalized  programs,  as  will  be  explained  below.  A  dual 
program  (P*)  may  be  constructed  which  is  of  the  same  type, 
except  that  it  involves  maximization  rather  than  minimization. 
The  dual  of  the  propram  (P#)  is  in  turn  (P).  The  extrema  in 
(P)  and  (p*)  are  generally  equal.  Ahe  ort;oal  solutions 
to  (P*)  are  g  nerally  the  Lagrange  multiplier  vectors  for 
(P),  while  the  optimal  solutions  to  (P)  are  the  Lagrange 
multiplier  vectors  for  (F*).  The  pairs  of  optimal  solutions 
* '  (P)  ani  (?*)  are  the  ^addle-points  of  a  certain 


Kuhn-Tucker  function. 
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An  intriguing  mathematical  feature  of  the  theory  to  he 
explained  is  that  it  constitutes  a  new  M  convex  algebra” 
closely  parallel  to  linear  algebra.  The  convex  bifunction 
F  plays  a  role  analogous  to  that  of  a  linear  transformation. 
Duality  is  obtained  by  the  construction  of  an  adjoint  bifunction 
F*  in  terms  of  Fenchel's  conjugacy  correspondence.  Whereas 
a  linear  transformation  and  its  adjoint  are  related  by  a 
bilinear  function,  a  convex  bifunction  and  its  adjoint  are 
related  b,  a  convex-concave  function,  and  the  formula 
<Fu,  x*>  =  <u,  F*x*>  appears  as  an  ”  inf=sup"  theorem  for 
a  dual  pair  of  programs.  Minimax  theory  is  associated  with 
the  ”  inverse”  operation  for  bifunctions. 

The  results  in  this  paper  are  based  on  the  general 
theory  of  convex  functions  and  especially  on  the  very  important 
notion  of  conjugacy  due  to  Fenchel  Cl?].  The  elementary  facts 
about  convex  functions  aic  r-viewed  in  $2.  Further  details 
can  be  found  in  the  works  of  Fenchel,  Br^ndsted,  Moreau 
and  Rockafellar  listed  among  the  references. 

The  complete  proofs  of  the  new  duality  theorems  and  of 
the  theorems  about  bifunctiona  are  all  contained  in  a  forth¬ 
coming  book  i>h] .  home  of  the  main  i leas  have  already 
appeared  ir.  other  papers  cl  the  author,  however.  A  perturbaticnal 
a;  proech  to  duality  theory  is  given  in  [«5]  and  Cj8].  The 
correspondence  between  concave-convex  functions  on  Rn  X  Rn 
and  convex  functions  or.  Rr,'rn  (here  the  oraph  functions  of 
convex  bi functions)  is  established  in  [ .  A  similar 
"  convex  altrebra”  for  multivalued  aur  pings  has  been  develo}  ed 
ir.  (56]  and  described  in  (5?]. 
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Some  applications  of  Fenchel's  theory  to  general  nonlinear 
programming  have  also  been  described  by  Ghouila-Houri  [23, 
Dennis  [7],  Dieter  [8,93,  Falk  and  Thrall  [1^3,  Karlin  [233  , 
and  Whinston  [463. 

An  excellent  discussion  of  general  Lagrange  multipliers 
as  "  equilibrium  prices"  has  been  given  by  Gale  [193  in  the 
case  of  concave  maximization  problems  depending  outside 
parameters. 

2.  Convex  functions  and  their  conjugates. 

The  object  of  the  finite-dimensional  theory  of  convex 
functions  is  the  study  of  pairs  (C,f>,  where  C  is  a 
non-empty  convex  set  in  Rn  and  f  is  a  real -valued  cor  vex 
function  on  C»  i.e.  a  function  from  C  to  R  satisfying 

(2.1)  f(3CL-\)x+\y)  <  (1-/0  f(x)  f  A  f(y),  0  <  X  <  1, 

for  any  x  e  C  and  y  s  C.  There  are  technical  advantages, 
however,  in  representing  each  suen  pair  by  a  function  which 
is  defined  on  all  of  Rn  but  which  may  have  infinite  values, 
namely  the  function  obtained  by  defining  f(x)  to  be 
+<©  for  x  t  C. 

In  general,  let  f  be  any  function  defined  on  all  of 

Rn  and  having  values  which  are  real  numbers  or  +®  .  The 

epigraph  of  f,  denoted  by  epi  f,  is  the  set  of  pairs 
(x,u)  in  Rn+1  such  that  x  e  Rn,  y,  c  R  and  ji  >  f(x). 

(Thus  epi  f  can  be  regarded  as  the  set  of  all  "  finite" 
points  lying  on  or  above  the  graph  of  f.)  Ve  define  f 
to  be  a  convey  function  on  Rn  if  epi  f  is  convex  as  a 
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subset  of  Rn+1.  If  there  is  no  x  such  that  f(x)  »  -qd  , 
this  definition  of  convexity  is  equivalent  to  inequality 
(2.1)  being  satisfied  throughout  Rn  (with  the  obvious 
rules  for  manipulating  +a>). 

If  f  is  convex,  the  set 

dom  f  -  ^x  I  f(x)  <  +od"^-, 

which  is  the  projection  of  epi  f  on  Rn,  is  convex;  it 
is  called  the  effective  domain  of  f.  A  convex  function 
f  on  Rn  is  said  to  be  proper  if  dom  f  is  non-empty 
and  f  is  finite  on  dom  f,  in  other  words  if  f  is  not 
the  constant  function  +oo  and  there  is  no  x  such  that 
f(x)  =  -cd  .  The  restriction  of  f  to  C  *  dom  f  is  then 
a  pair  (C,f)  of  the  type  mentioned  above,  and  every  such 
pair  arises  in  this  way.  Thus  the  study  of  the  pairs  (C,f) 
is  replaced  by  the  study  of  the  proper  convex  functions 
f  on  Rn. 

Convex  functions  which  are  not  proper  can  arise 
naturally  as  the  result  of  certain  operations,  and  they  do 
have  some  technical  uses.  The  fundamental  fact  about  an 
improper  convex  function  f  on  Rn  is  that  f  must  be 
identically  -®  on  the  interior  of  dom  f. 

A  useful  example  of  a  convex  function  is  the  indicator 
function  $(*IC)  of  a  convex  set  C  in  Rn,  where  S(xiC)  =  0 
for  x  e  C  and  S(xiC)  =  +od  if  x  t  C.  If  fQ  is  a  finite 
convex  function  on  Rn,  the  convex  function  f  *  f0+  $(*IC) 
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agrees  with  f  on  C  and  is  +od  elsewhere.  Minimizing 
f  on  C  is  equivalent  to  minimizing  f  over  all  of  Rn. 

We  shall  use  this  device  to  re-express  all  constrained 
extremum  problems  as  formally  unconstrained  problems. 

Let  f  be  a  convex  function  on  Rn,  and  let  D 
denote  the  collection  of  al1  nairs  (x*,  p*)  such  that 
x*  e  Rn,  p*  e  R  and 

f(x)  >  <x,  x*>  -  p*,  Vx  e  Rn. 

The  pointwise  supremum  of  the  corresponding  collection 
of  affine  functions  h(x)  =  «x,  x*>  -  u*  is  called  the 
closure  of  f  and  is  denoted  by  cl  f.  Thus  by  definition 

(2.2)  (cl  f)(x)  =  sup  |  <x,  x*>  -  u*  I  (x* ,p*)  e  . 

When  cl  f  =  f ,  one  says  that  f  is  closed.  If  f  is 
proper,  it  can  be  shown  that  the  epigraph  of  cl  f  is  simply 
the  closure  in  Rn+1  of  the  epigraph  of  f.  Then  cl  f  is 
a  closed  proper  convex  function  on  Rn,  and 

(2.3)  (cl  f)(x)  =  lim  inf  f(y) ,  Vx  e  Rn. 

y-x 

In  particular,  a  proper  convex  function  is  closed  if  and  only 
if  it  is  lower-semicontinuous ,  i.e.  has  the  property  that  the 
convex  level  set  ^x  J  f(x)  <  is  closed  in  Rn  for  each 
real  p. 

For  a  proper  convex  function  f,  (cl  f)(x)  must 
actually  coincide  with  f(x)  for  every  x  in  the  interior 
of  dom  f  or  outside  the  closure  of  dom  f.  Thus  f  -*  cl  f 
may  be  regarded  as  a  regularizing  operation  which  simply 
redefines  f  at  certain  boundary  points  of  its  effective 
domain,  so  as  to  make  f  lower-semicontinuous.  For  an 
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improper  convex  function  f,  cl  f  is  the  constant  function 
-od  or  the  constant  function  +a>  ,  depending  on  whether  or 
not  dom  f  is  non-empty. 

Fenchel's  important  notion  of  conjugacy  is  obtained  by 
further  consideration  of  the  set  D  introduced  above.  Clearly 
D  consists  of  the  pairs  (x*,u*)  in  Rn+^  such  that 
u*  >  f *(x*) ,  where 

(2.4)  f*(x*)  =  supx{<x,x*>  -  f(x)^  . 

Thus  D  is  the  epigraph  of  a  certain  function  f*  on  Rn. 

This  f*  is  called  the  conjugate  of  f. 

It  can  be  se en  that  f*  is  a  closed  crivex  function  on 
Rn,  proper  if  and  only  if  f  itself  is  proper.  The  conjugate 
f**  of  f*  is  in  turn  given  by 

f**(x)  =  supx„  ^  <x,x*>  -  f*(x*)*y  . 

But  this  supremum  is  the  same  as  the  supremum  in  (2.2)  Thus 
f**  s  cl  f.  In  particular,  if  f  is  closed  it  is  the 
conjugate  of  its  conjugate  f*. 

Conjugacy  therefore  defines  a  one-to-one  symmetric 
correspondence  in  the  class  of  all  closed  convex  functions 
on  Rn. 

As  an  example,  the  conjugate  of  the  indicator  function 

S  (•  1C)  of  a  convex  set  C  in  Rn  is  given  by 

S*(x*IC)  *  sup  •(  <x,x*>  -  S(xlC)\  «  sup  <x,x*>. 
x  v  *  xcC 

The  function  S * ( • * C)  is  called  the  support  function  of  C. 

A  convex  function  f  on  Rn  is  necessarily  continuous 
on  the  interior  of  its  effective  domain.  It  is  differentiable 
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almost  everywhere  on  any  open  set  where  it  is  finite. 

Assume  that  x  is  any  point  where  f  is  finite. 

The  (one-sided)  directional  derivative 

(2.5)  f'(x;y)  -  lim  -  f(*) 

\io  K 

exists  and  is  a  convex  function  of  y  (possibly  with  the 
valuer  +oo  ).  Of  course,  if  f  is  actually  differentiable 

at  x,  we  have 

(2.6)  f’(x;y)  .  <Vf(x),  y>, 

where  Vf(x)  is  the  gradient  of  f  at  x, 

W*>  *  (  M  (x) . Cx)). 

3*1  5xn 

If  f  is  not  differentiable  at  x,  the  directional  derivatives 
can  still  be  described  in  terms  of  "  subgradients"  .  A 
subgradient  of  f  at  x  is  a  vector  x*  e  Rn  such  that 
f(z)  >  f(x)  +  <z— x ,  x*>,  \/z  e  Rn. 

The  set  of  subgradients  x*  at  x  is  a  certain  closed 
convex  (possibly  empty)  set  denoted  by  ^f(x).  The  case 
where  ^f(x)  consists  of  Just  one  x*  is  precisely  the 
case  where  f  is  finite  and  differentiable  at  x,  the 
unique  subgradient  then  being  V?(x).  It  can  be  shown  tnat, 
if  x  is  actually  an  interior  point  of  dom  f,  ^f(x)  is 
non-empty  and  compact,  and 

(2.7)  f'(x;y)  -  max^<x*,y>  I  x*  e  V(x)^,  *  S*(y*  df(x)) 
for  each  y  e  Rn.  In  general,  ^f(x)  is  eirpty  and  only  if 
f’(xjy)  -  -®  for  some  y. 
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Vhen  <Jf(x)  is  non-empty,  one  necessarily  has 
(cl  f)(x)  *  f(x).  On  the  other  hand,  when  (cl  f)(x)  *  f(x) 
one  has  x*  e  ^f(x)  if  and  only  if  x  e  «^f*(x*).  Thus 
the  multivalued  mapping  7&f*:  x*  -*  is  the  inverse 

of  the  multivalued  mapping  ^f:  x  —  ^tf(x),  when  f  is  a 
closed  proper  convex  function. 

Note  that  0  e  &f(x)  if  and  oniy  if  f  attains  its 
minimum  at  x.  We  shall  use  this  fact  later  in  a  slightly 
different  form:  when  (cl  f)(0)  «  f(0),  the  vectors  x* 
in  ^f(O)  are  the  same  as  those  for  which  0  eV(x'), 
i.e.  for  which  f*  attains  its  minimum. 

The  conjugate  of  a  differentiable  convex  function 
f  on  Rn  is  closely  relat id  to  the  Legendre  transform  of 
f.  Let  C*  be  the  ret  of  all  gradients  x*  of  f,  i.e. 
the  image  of  Rn  urder  the  mapping  x  -*  'Vf(x).  tfiven  any 
x*  e  C*,  the  vectors  x  for  which  the  supremum  in  (2.4) 
is  attained  are  precisely  those  for  which  x*  ■  ^(x);  thus 
(2„8)  f*(x*)  ■  <x,x*>  — f ( x)  when  x*  *  ^Jt(x). 

If  the  mapping  Vf  is  one-to-one,  we  get 

(2.9)  f(x*)  -  <(  VO"1(x*).x*>  -f((Vf)“1(x*)),  x*  e  C*. 
This  is  the  formula  for  the  Legendre  transform  of  f. 

If  Vf  is  not  one-to-one,  we  can  still  conceive  of 
parameterizing  C*  in  terms  of  x  by  moans  of  the  nonlinear 
substitution  x*  -  the  substitution  yields  the  formula 

(2.10)  f(Vf(x;)  -  <x,  Vf(x)>  -f(x). 

This  function  of  x  is  one  which  is  common  in  the  literature 
of  nonlinear  programming.  It  is  generally  not  convex,  of 
course,  and  it  generally  does  not  express  f*  completely. 
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The  set  C*  need  not  be  convex  in  R11,  and  there  may  be 
points  outside  of  C*  where  f*  is  finite  and  the  Legendre 
transform  is  undefined. 

It  will  be  convenient  in  what  follows  to  place  concave 
functions  on  an  equal  footing  with  convex  functions.  A 
function  g  from  Rn  to  C-a\  +cd  is  said  to  be  concave, 
of  course,  if  f  ■  -g  is  convex.  Ail  the  above  facts 
and  definitions  for  convex  functions  have  obvious  analogues 
for  concave  functions,  in  which  the  roles  of  +ao  ,  inf  and 
<  are  interchanged  with  those  of  -a>  ,  sup  and  >.  In 
particular,  the  conjugate  of  a  concave  function  g  is 
defined  by 

6* (x* )  »  inf^|<x,x*>  -g(x)^.  . 

It  should  be  noted  that  g*  is  not  the  same  as  -f*,  where 
f  =  -g.  Instead  one  has  g*(x*)  =  -f*(-x*). 

5.  Dual  programs  and  adjoint  bifunctions. 

By  a  convex  bifunction  from  Rm  onto  Rn,  we  shall 
mean  a  correspondence  F  which  assigns  to  each  u  e  Rm 
a  function  Fu  from  Rn  to  [-ao  ,  +ac  ],  such  that 
(Fu)(x)  is  a  (jointly)  convex  function  of  (u,x)  on 
Rm+n.  This  function  on  Rm+n  is  called  the  p-raph  function 
of  F.  We  shall  say  F  is  closed  or  proper  according  to 
whether  its  graph  function  is  closed  or  proper.  The  effective 
domain  of  F  is  defined  to  be  the  (convex)  projection  on 
Rm  of  the  effective  domain  of  the  graph  function,  i.e. 


dom  F 
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-^u  I  3x,  (Fu)(x)  <  +0D^  . 

If  F  is  closed,  proper  and  convex,  then  in  particular 
Fu  is  a  closed  convex  function  on  Rn  for  every  u,  proper 
for  u  e  dom  F  but  identically  +qd  for  u  t  dom  F. 

For  example,  let  fQ,  flf...,f  be  finite  convex 
functions  on  Rn,  and  for  each  u  =  (u^,...,u  )  define 
the  function  Fu  by 

(5.1)  (Fu) (x)  -  ffQ(x)  if  fx(x)  <  ulf...,fm(x)  <  um, 

\+o>  if  not. 

It  is  easily  demonstrated  that  F  is  a  closed  proper  convex 
bifunction.  Note  that  dom  F  consists  of  the  vectors  u 
such  that  the  corresponding  inequality  system 

flW  <  ui . Vx)  S  “a 

has  at  least  one  solution  x. 

For  another  example,  let  A  be  a  linear  transformation 
from  Rm  to  Rn  and  let 

(3.2)  (Fu)  (x)  =/  0  if  x  *  Au, 

l+a>  if  x  /  Au. 

This  F  is  a  closed  proper  convex  bifunction  which  we  call 
the  indicator  bifunction  of  A.  We  shall  see  that  the 
"  convex  algebra"  below  reduces  to  ordinary  linear  algebra 
when  the  bifunctions  are  taken  to  be  such  indicator  bifunctions. 

Henceforth  we  assume  for  simplicity  that  F  is  a 
certain  closed  proper  convex  bifunction  from  R®  onto  Rn. 

The  program  (P)  associated  with  F,  as  in  the  intro¬ 
duction,  is  that  of  minimizing  FO  on  Rn.  Of  course, 
minimizing  FO  on  Rn  is  equivalent  to  minimizing  FO 
over  the  convex  set  dom  (FO),  since  FO  has  only  the 
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value  +od  outside  this  set.  The  elements  of  dom  (FO) 
will  be  called  the  feasible  solutions  to  (P).  This  is 
suggested  by  the  case  of  (P)  where  F  is  given  by  (3.1), 
which  we  refer  to  as  the  case  of  a  classical  convex  program. 
Feasible  solutions  to  (F)  exist  if  and  only  if  0  e  dom  F, 
in  which  event  we  say  (P)  is  consistent.  If  0  is  actually 
an  interior  point  of  dom  F,  we  say  (P)  is  strictly 
consistent.  In  the  classical  case,  (P)  is  strictly  con¬ 
sistent  if  and  only  if  there  exists  an  x  such  that 
fA(x)  <  0  for  i  ■  l,...,m. 

The  fundamental  and  easily  proved  fact  on  which  our 
analysis  of  (P)  depends  is  that  the  function  inf  F,  where 

(inf  F)(u)  *  inf(Fu)  *  infx(Fu)  (x), 
is  a  convex  function  on  Rm  whose  effective  domain  is  the 
same  as  dom  F.  The  theory  of  closures,  conjugates, 
directional  derivatives  and  subgradients  of  convex  functions 
can  therefore  be  applied  to  the  study  of  inf  F  at  u  -  0. 

For  example,  if  (P)  is  strictly  consistent,  0  is 
in  the  interior  of  the  effective  domain  of  inf  F,  so  we 
may  conclude  at  once  that  (inf  F)(u)  depends  continuously 
on  u  for  sufficiently  small  perturbations  u. 

Assume  that  (inf  F)(0)  is  finite.  By  definition, 
u*  is  a  Lagrange  multiplier  vector  for  (P)  if  and  only  if 
(inf  F) (u)  >  (inf  F)  (0)  -  <u,u*>,  e  R°, 
in  other  words  if  -u*  is  a  subgradient  at  0: 

(3.3)  -u*  c  d(inf  F)(0) . 
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If  (P;  is  strictly  consistent,  so  that  0  is  an  interior 
point  of  dom  (inf  T  ,  we  know  from  the  general  theory 
that  ^(inf  F)(0)  is  a  non-empty  compact  convex  set  in 
R“  whose  support  function  is  the  directional  derivative 
function 

(3.4)  (inf  F)'(0;  u)  -  lim  £ipf  F)-C P-). 

KiO  K 

In  particular,  a  Lagrange  multiplier  vector  u*  «  (uj,...,u£) 
does  exist  when  (P)  is  strictly  consistent.  This  u*  is 
unique  if  and  only  if  inf  F  is  actually  differentiable  at 
0,  in  which  case  one  has 

(3.5)  uj  =  (inf  F)(0),  i  =  l,..,m. 

(Thus,  for  example,  in  a  classical  convex  program  the 
Lagrange  multipliers,  if  unique,  give  the  rates  of  change  of 
the  infimum  with  tespect  to  changes  of  the  constant  terms  in 
the  corresponding  constraint  inequalities.)  By  the  general 
theory  of  subgradienta ,  a  Lagrange  multiplier  vector  fails 
to  exist  for  (P)  if  and  only  if  there  exists  a  u  such 
that  (inf  F)'(o;  u)  -  .  The  interpretation  of  this  case 

is  that  there  is  some  direction  of  perturbation  in  which  the 
"minimal  cost"  drops  off  infinitely  steeply,  so  that  10 
finite  "prices"  for  the  perturbation  variables  can  bring 
about  a  state  of  equilibrium. 

To  get  the  program  dual  to  (P),  we  need  to  introduce 
the  adjoint  of  F.  This  is  the  bi function  F*  from  Rn 
onto  R®  given  by  x*  -  F*x*, 


where 
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(3.b)  (F*x*)(u*)  »  inf  ^(Fu)(x)-<x,x*>+<u,u*>  |>  . 


u,x 


Note  that,  for  the  graph  function  f  of  F,  one  has 

(F*x*)(u*)  ■  -sup ^ <u,-u*>+<x,x*>-f (u,x)^-  *  -f*(-u*,x*), 


u,x 


where  f •  is  the  conjugate  of  f  on  Rtt+n.  Thus  F*  is  a 

closed  proper  concave  bifunction  in  the  obvious  sense. 

The  adjoint  of  a  concave  bifunction  is  defined  in  the 

same  way,  except  of  course  that  "  sup”  replaces  "  inf"  . 

Thus  the  adjoint  F**  of  F*  is  defined  in  turn  by 

(PMu)(x)  -  sup  f(F*x*)(W)  ~<u,u*>+<x,x*>^ 
x* ,u* v 

sup  yCu,u*>+<x,x*>-f  *(u*  ,x*)f  -  f**(u,c). 

,»  V*'1  J 


u’  ,x" 


Since  t**  ■  f  under  the  conjugacy  correspondence,  we  have 


-  F. 


It  is  easy  to  see  that,  when  F  is  the  convex  indicator 
bifunction  of  a  linear  transformation  A  from  Rm  to  Rn,  F* 
is  the  conc-ive  indicator  bifunction  of  the  adjoint  linear 
transformation  A*  from  Rn  back  to  Rm  (corresponding  to 
the  transpose  matrix),  i.e.  (F*x*)(u*)  is  o  if 
u*  -  A*  x*  and  -a>  if  u*  t  A*  x*.  In  this  sense,  the 
adjoint  operation  for  bifunctions  generalizes  the  one  for 
linear  transformations.  Further  justification  of  the  "adjoint" 
terminology  will  be  given  in  the  nest  section. 

We  define  the  dual  program  (P*)  to  be  thtt  of 
maximizing  the  concave  function  F*0  on  Rm,  In  (F*) 
we  are  also  interested  in  the  properties  of  the  function 
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sup  F*  at  x*  =  0,  where 

( sup  F’Hx*)  «  3UP  (F*x*)  -  SUPu,(F*X*)(u*). 

Thus  x*  is  taken  to  be  the  perturbation  variable  in  (P*)» 
while  u*  is  the  vector  variable  over  which  one  maximizes. 
Of  course,  sup  F*  turns  out  to  be  a  concave  function 
on  Rn.  All  of  what  we  have  just  said  about  inf  F  in  (P) 
applies  to  sup  F*  in  (Pa)  with  only  the  obvious  changes. 
The  dual  of  (P*)  is  in  turn  (P) ,  inasmuch  as  F**  =  F. 

As  an  example,  let  A  be  a  linear  transformation  from 


Rn  to 

Rm,  fix  a  c  R°  and 

a*  e  R11, 

and 

set 

(3.7) 

(Fu)  (x)  =i<x,a*> 

if  x  >  0 

and 

Ax 

1  +qd  if 

not. 

(This  is  the  case  of  (3.1)  where  the  functions  f^  are  all 
affine.)  Minimizing  FO  in  (P)  is  then  the  same  as 
minimizing  <x,a*>  subject  to  x  >  0  and  Ax  >  a,  an 
ordinary  linear  program.  By  a  straightforward  calculation 
from  the  definition  of  F*, 

(*.S)  (F*x*)(u*)  *  J<a,u*>  if  u*  >  0  and  A*u*  <  a0  -x*, 

-a>  if  not. 

Thus  maximizing  F*0  in  (P*)  is  the  same  as  maximizing 
<a,u*>  subject  to  u*  >  0  and  A*u*  <  a*,  the  ordinary 
dual  linear  program. 

The  dual  programs  of  Fenchel,  extended  by  the  present 
author  in  [43j,  may  also  be  represented  here  as  a  special 
case.  Again  let  A  be  a  linear  transformation  from  Rn 
to  Rm,  let  f  be  a  closed  proper  convex  function  on  Rn 
1 et  g  be  a  closed  proper  concsvt  function  on  Rm. 
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Define  F  by 

(3.9)  (Fu) (x)  *  i*(x)  ~g(Ax  4  a). 

Then  F  is  a  closed  proper  convex  bifunction,  and  (?) 
consis  "■>  of  minimizing  i(x)  -g^Ax)  in  x  s  Rn.  Note  that 
the  perturbation  u  here  corresponds  to  a  translation  of  the 
function  g  on  Rm.  By  elementary  calculation, 

(5.10)  (F*x*)(u*)  «  -f*(A*u*+x*), 

so  that  (P*)  consists  of  maximizing  g*(u*)  -f*(A*u*) 

in  u*  e  Rm,  Fenchel's  original  programs  are  the  case  where 
A  is  the  identity  transformation. 

For  the  classical  convex  program,  the  adjoint  bifunction 
is  given  by 

(F*x*)(u*)  «J-(f0+uJf1+...+U;  fm)*(x*)  if  uMu?,...,u*)  >  0, 
l-CD  if  U*  >  0. 

Thus  the  dual  program  (P*)  is  to  maximize  -(fQ  +  u£  f-^4. .  .+u£fm)*(0) 
subject  to  u£  >  0,  i  *  l,...,m.  To  calculate  the  conjugate 
of  f  »  fQ  +  u-J  f^  +  ...+  u*t  fm  explictly,  one  would  have 
to  know  more  about  the  given  functions  f.. .  However,  if 
every  f^  is  differentiable  one  can  apply  the  Legendre  trans¬ 
formation  in  the  weakened  form  of  (2.10)  to  f  to  get  a 
problem  which  is  "  almost"  equivalent  to  (P*).  Since 
-f*(  Vf(x))  -  f(x)  by  (2.10)  when  Vf(x)  =  0,  and 

Vf  -  Vf0  ♦  «i  Vr,  .....  u- vrm, 

the  "  approximate"  problem  is  that  of  minimizing 
fQ(x)  ♦  uj  fx(x)  ♦  uj  fm(x) 
in  u*  e  Rm  and  x  e  Rn  subject  to  the  constraints 

u*  >  0,  Vf0(x)  +  u£  Vfx(x)  +  ...+  u* Vfm(x)  *  0. 
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This  is  the  well-known  dual  problem  which  was  discovered  by 
Wolfe  [47.1. 

It  should  be  pointed  out  that  the  classical  convex 
program  can  be  modified  in  many  ways  by  introducing  add¬ 
itional  perturbations.  For  instance,  one  can  perturb  the 
constraint  f^(x)  <  u^  by  a  translation  y^  to  the  con¬ 
straint  f^(x-y^)  <  u^.  The  dual  problem  would  then  turn  out 
to  involve  an  additional  Lagrange  multiplier  vector  y?  dual 
to  the  perturbation  variable  y^  e  RnJ  this  would  essentially 
be  the  dual  problem  for  the  classical  convex  program  given 
by  the  author  in  [403.  The  possibilities  for  perturbation 
are  endless.  The  perturbations  can  be  chosen  to  suit  the 
situation,  according  to  what  "  equilibrium  prices"  one  is 
interested  in.  To  apply  the  duality  theory  described  here, 
it  is  only  necessary  that  the  perturbations  be  ' •  convex  ", 
in  the  sense  that  the  dependence  of  the  problem  on  the 
perturbations  be  representable  in  terms  of  a  convex 
bifunction  F. 

All  the  results  relating  the  general  dual  pair  of 
programs  (P)  and  (P*)  are  based  on  one  elementary  fact, 
which  follows  directly  from  the  definitions:  the  convex 
minimand  FO  in  (P)  is  the  conjugate  of  the  convex  function 
-sup  F*  on  Rr%  while  the  concave  maximand  F*0  in  (P*) 
is  the  conjugate  of  the  concave  function  -inf  F  on  Rm, 

This  implies  that 

(FO)*  =  (-sup  F*)**  -  -cl  (sup  F*), 

(F*0) *  *  (-  inf  F)**  *  -cl  (inf  F) , 
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and  hence  that 

(3.12)  cl ( sup  F*)(0)  -  -sup^<x,0>  -(F0)(x)^  =  (inf  F)(0), 

cl (inf  F) (0)  -  -infu,{  <0,u*>  -(F*0)(u*)^  *  (sup  F*)(0). 
The  infimum  (inf  P)(0)  in  (P)  is  thus  always  greater  than 
or  equal  to  the  supremum  (sup  F*)(0)  in  (P*),  and  any 
possible  discrepancy  between  these  extrema  is  completely 
explained  in  terms  of  the  closure  operations  for  convex 
and  concave  functions. 

Let  us  call  (P)  normal  if  cl(inf  F)(0)  *  (inf  F)(0). 

If  (P)  is  consistent,  this  is  equivalent  to  the  semicontinuity 
condition  that 

lim  inf  (inf  F)(u)  «  (inf  F)(0). 
u-»0 

Similarly,  let  us  call  (P*)  normal  if  cl(sup  F*)(0)  =  (sup  F*)(0) 
in  the  sense  of  the  closure  operation  for  concave  functions. 

Formulas  (3.12)  then  yield  a  good  duality  theorem:  (P)  is 
normal  if  and  only  if  (P*)  is  normal .  Moreover  the  normal 
case  is  precisely  the  one  where  the  extrema  in  (P)  and  (P*) 
are  equal,  i. e. 

(3.13)  (inf  F) (0)  »  (sup  F*)(0), 

For  brevity,  we  shall  say  that  normality  holds  when  both 
programs  are  normal  and  the  ’Inf"  and  "  sup"  are  equal. 

Normality  holds  in  particular,  then,  when  (P)  is  strictly 
consistent  (since  then  inf  F  is  continuous  at  0),  or  when 
a  Lagrange  multiplier  vector  exists  for  (P)  (since  then 
^ (inf  F) (0)  4  0,  implying  that  cl(inf  F)  agrees  with 
inf  F  at  0).  Likewise,  normality  holds  when  (P*)  is 
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strictly  consistent,  or  when  a  Lagrange  multiplier  vector 
exists  for  (P*). 

Suppose  that  normality  holds,  and  that  the  common 
extremum  value  in  (3*13)  is  finite.  As  we  have  already 
pointed  out,  u*  is  a  Lagrange  multiplier  vector  for  (P) 
if  and  only  if  u*  e  fc(-inf  F)(0).  Since  (-inf  F)*  -  F*0, 
this  is  equivalent  to  the  condition  that  0  c  ^(F*0)(u*), 
i.e.  that  the  concave  function  F*0  attain  its  maximum  at 
u*.  Similarly,  the  Lagrange  multiplier  vectors  x  for 
(P*)  are  the  vectors  where  the  convex  function  FO  attains 
its  minimum.  This  gives  us  another  duality  theorem: 
assuming  that  normality  holds,  the  Lagrange  multiplier 
vectors  u*  for  (P)  are  precisely  the  optimal  solutions 
(if  any)  to  (P*)»  while  the  optimal  solutions  x  to  (P) 
are  precisely  the  Lagrange  multiplier  vectors  for  (P*). 

This  type  of  duality  has  previously  been  known  only  in  the 
linear  programming  case. 

4.  Kuhn-Tucker  functions  and  minlmax  theroy. 

We  shall  now  describe  the  correspondence  between  convex 
bi functions  from  R®  to  Rn  and  concave-convex  functions  on 
Rm  X  Rn  which  is  analogous  to  the  correspondence  between 
linear  transformations  from  R®  to  Rn  and  bilinear  functions 
Rm  X  Rn.  This  correspondence  gives  further  insight  into  the 
nature  of  the  adjoint  bifunction.  It  enables  us  to  construct 
for  each  dual  pair  of  programs  (P)  and  (P*)  as  in  the  last 

section  a  certain  convex-concave  function  whose  saddle-points 
correspond  to  optimal  solutions  to  the  programs  much  as  in  the 


classical  Kuhn-Tucker  theory  [24]. 

Let  K  be  a  concave-convex  function  on  Rm  XRn,  i.e. 
a  function  with  values  in  L-od  ,  +od  ]  such  that  K(u,v)  is 
concave  in  u  for  each  v  and  convex  in  v  for  each  u. 
Closure  operations  may  be  applied  to  K  for  the  sake  of 
regularization.  Let  clyK  be  the  function  on  Rm  X  Rn 
obtained  by  closing  K(u,v)  as  a  convex  function  of  v  for 
each  u.  Similarly  let  cluK  denote  the  function  obtained 
by  closing  K  as  a  concave  function  of  u  for  each  v. 

Then  cluK  and  clyK  are  concave-convex  functions  on 
l^XH11  too. 

Ve  can  proceed  now  to  form  the  cor.c-v<-  '  -  ::v  ;x  functions 
cly  cluK  and  clu  clyK.  The  first  of  these  is  called  the 
lower  closure  of  K  (since  the  final  regularization  involves 
lower-semicontinuity),  and  the  second  is  called  the  upper 
closure  of  K.  If  K  coincides  with  its  lower  closure, 
it  is  said  to  be  lower-closed,  and  so  forth.  It  turns  out 
that  cly  cluK  is  itself  always  lower-closed,  and  clu  clyX 
is  upper-closed,  but  these  two  functions  may  disagree  at 
certain  points  of  Rm  XRn. 

Since  the  operations  cl„  cl  and  cl  cl  do  not 
quite  produce  the  same  result,  there  is  not  a  unique 
natural  closure  operation  for  concave-convex  functions. 
Nevertheless,  there  is  an  important  phenomenon  of  pairing  of 
closures.  It  may  be  shown  that,  if  K  is  any  lower-closed 
concave-convex  function  on  Rm  X  Rn,  then  K  *  c*uK  is  an 
upper-closed  concave-convex  function  such  that  clyK  »  K. 
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Thus  there  is  a  simple  one-to-one  correspondence  between 
the  lower-closed  functions  and  the  upper-closed  functions. 
Corresponding  functions  K  and  K  generally  have  the  same 
values,  except  at  certain  points,  and  K  < 


For  example,  let  C  and  D  be  closed  convex  sets  in 


Rm  and 

Rn,  respectively, 

and 

let 

K 

be 

any  continuous 

finite 

concave-convex  function 

defined 

on 

C  X  D.  Set 

(4.1) 

K(u,v) 

if 

U  E 

C 

and 

V  £  D, 

K(u,v)  +ao 

if 

U  £ 

C 

anu 

v  t  D, 

^-OD 

if 

u  t 

C. 

Tk(u,v) 

if 

U  £ 

c 

and 

V  £  D, 

K(u,v)  =  <  +00 

if 

v  t 

D 

u 

if 

u  t 

c 

and 

V  E  D. 

Then  K  and  K  are  lower-closed  and  upper-closed  concave-convex 
functions,  respectively,  which  are  paired  together  in  the  manner 
just  described.  Observe,  incidentally,  that 

sup  inf  K(u,v)  =  sup  inf  K(u,v)  =  sup  inf  K(u,v), 
u  v  “  u  v  ueC  veD 

inf  sup  K(u,v)  =*  inf  sup  K(u,v;  =  inf  sup  K(u,v). 
u  v  u  v  veD  ueC 

Thus  the  minimax  analysis  of  K  with  respect  to  C  X  D  can 

be  represented  by  the  formally  unconstrained  minimax  analysis 

of  K  or  of  K  (or  of  any  extension  of  K  to  all  of 

Rm  X  Rn  such  that  K  <  K  <  7) . 

In  order  to  apply  these  facts  to  the  study  of  bifunctions 
in  a  manner  su  :  estive  of  linear  alretra,  we  introduce  an 
inner  product  notation  for  the  conjugate  of  a  convex 
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(or  concave)  function  f: 

<f,x*>  =  <x*,f>  =  f*(x*). 

Then,  for  any  convex  bifunction  from  Rm  onto  Rn,  we 
can  form 

(4.2)  <Fu,x*>  =  <x* ,Fu>  =  (Fu)*(x*) 

as  a  function  of  u  e  Rm  and  x*  e  Rn.  Note  that,  if  F 
is  the  indicator  bifunction  of  a  linear  transformation 
A ?  Rm-‘Rn  as  in  (3.1),  then  <Fu,x*>  is  simply  the  bilinear 
function  <Au,x*>  associated  with  A. 

The  basic  theorem  is  the  following.  If  F  is  any 
closed  convex  bifunction  from  Rm  onto  Rn,  then  <Fu,x*> 
is  a  lower-closed  concave-convex  function  on  Rm  X  Rn. 
Conversely,  given  any  function  K  of  the  latter  type,  there 
exists  a  unique  closed  convex  bifunction  F  from  Rm  onto  Rn 
such  that  K(u,x*)  *  <Fu,x*>,  namely  the  F  given  by 
(Fu) (x)  =  SUPX*  {<x,x*>  -K(u,x*)|-  . 

The  upper-closed  X  on  Rm  X  Rn  paired  with  K  is  precisely 
the  concave-convex  function  associated  with  the  adjoint 
bifunction  F*,  i.e. 

R(u,x*)  ■  <u,F*x*>  =  (F*x*)*(u). 

Thus  the  formulas 

(4.3)  clu  <Fu,x*>  =  <u,F*x*>, 

<Fu,x*>  =  clx,<u,F*x*>, 

hold  for  any  closed  convex  bifunction  and  its  adjoint. 

Formulas  (4.3)  generalize  the  familiar  formula 
<Au,x*>  =  <u,A*x*> 

relating  a  linear  transformation  and  i',s  adjoint.  Since 
the  closure  operations  in  (4.3)  merely  redefine  the  functions 
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at  special  points,  one  will  actually  have 

(4.4)  <Fu,x*>  *  <u,F*x*> 
for  "most"  values  of  u  and  x*. 

Observe  that  (4.4)  expresses  a  duality  between  two 
different  extremum  problems,  because  by  definition 

(4.5)  <Fu,x*>  *  supx  ^<x,x*>  -(Fu)(x)^ 

<u,F*x*>  »  infu*^<u,u*>  -(F*x*) (u*)*^  . 

In  particular,  we  have 

(4.6)  -<Fu,0>  =  infx(Fu)(x)  =  (inf  F)(u), 

-<0,F*x*>  =  SUPU*  (F*x*)(u*)  =  (sup  F*)(x*). 

The  equality  of  the  extrema  in  the  programs  (P)  and  (P*) 
in  the  last  section  is  therefore  expressed  simply  by 
<F0 , 0>  =  <0,F*0>. 

Minimax  characterizations  of  duality  are  obtained 
through  the  introduction  of  inverse  bifunctions.  The 
inverse  of  a  convex  bifunction  F  from  Rm  onto  Rn  is 
the  concave  bi function  F*  from  Rn  to  Rm  defined  by 

(4.7)  (F,x)(u)  -  -(Fu)(x). 

If  F  is  closed,  F*  is  closed  too.  The  inverse  of  a 
concave  bifunction  is  defined  in  the  same  way.  It  is 
easily  seen  that  F,,  *  F  and  (F*),  *  (F.)*,  The  latter 
bifunction  from  Rm  to  Rn  will  be  denoted  simply  by  FJ. 

As  an  example,  if  F  is  the  convex  indicator  bifunction 
of  a  non-singular  linear  transformation  A  as  in  (5.2),  then 
F,  is  the  concave  indicator  bifunction  of  A-1,  i.e.  (F,x)(u) 
is  0  if  u  *  A~^x  and  -m  if  u  /  A_1x.  Likewise,  F; 


is  the  convex  indicator  bi function  of 

Given  any  closed  proper  convex  bifunction  F  from 
Rm  to  Rn,  we  define  the  Kuhn-Tucker  function  of  the 
corresponding  program  (P)  to  be  <u*,F,x>  as  a  function 
of  u*  and  x.  Since  F,x  is  concave,  we  have  by  definition 

(4.8)  <u*,F,x>  ■  infu^<u,u*>  -(F*^(u)^ 

*  infu^<u,u*>  ♦(Fu)(xj}  . 

This  is,  of  course,  an  upper-closed  concave-convex  function 
on  R®  XRn  by  the  correspondence  theory  already  outlined. 

In  the  case  of  a  classical  convex  program,  where  F  is 
given  by  (3=1) »  the  Kuhn-Tucker  function  is  evidently  given  by 

(4.9)  <u*,F,x>  » /fQ(x)  +  uj  f^(x)  ♦  ...+  u*  fm(x)  if  u*=(uj,.. 

(.-OD  if  u*  >  0. 

Except  for  the  convenient  concave  extension  by  means  of  -ao  , 
this  is  the  function  associated  with  (P)  by  the  familiar 
Kuhn-Tucker  theory. 

In  the  case  where  F  is  given  by  (3.9),  the  Kuhn-'i'ucker 
function  is  given  by 

(4.10)  <u*,F.x>  -  f(x)  ♦  g*(u*)  -<Ax,u*>, 
with  -a>  +CD  taken  to  be  +cd  , 

A  saddle-point  of  the  Kuhn-Tucker  function  is,  of  course, 
a  vector  pair  (u*,x)  such  that 

(4.11)  <u*',F.x>  <  <u*  ,F,x>  <  <u  •  ,F,x  '> ,  \/u* ' ,  Vx ' . 

The  main  result  is  this:  a  vector  pair  (u*,x)  is  a 
saddle-point  of  the  Kuhn-Tucker  function  of  (p)  if  and 
only  if  u*  is  a  Lagrange  multiplier  vector  for  (?)  and 
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x  is  an  optimal  solution  to  (P).  In  this  event  normality 
holds,  and  the  minimax  value  <u*,F,x>  coincides  with  the 
infimum  in  (P)  and  the  supremum  in  (P*).  Moreover,  as 
explained  in  the  last  section,  u*  is  then  dually  an  optimal 
solution  to  (P*),  and  x  is  a  Lagrange  multiplier  vector 
for  (P*). 

Given  any  upper-closed  concave-convex  function 
K  on  Rm  X  Rn  (for  instance  a  K  of  the  type  in  (4.1)), 
there  is,  as  we  know,  a  unique  closed  concave  bifunction 
G  from  Rn  onto  Rm  sue*,  that  !r(u*,x)  *  <u*,Gx>.  Hence 
there  i3  a  unique  program  (P)  having  R  as  its  Kuhn-Tucker 
function,  namely  the  (P)  corresponding  to  F  «  G,.  The 
inverse  operation  for  bifunctions  tnerefore  corresponds  to 
a  general  minimax  theory  for  concave-convex  functions  in  the 
same  way  that  the  adjoint  operation  for  bifunctions  corresponds 
to  a  general  duality  theory  for  convex  programs.  It  is  clear 
from  the  definitions  that  the  F  and  F*  here  are  expressible 
in  terms  of  K  by 

(4.12)  (Fu) (x)  -  supu.  |  K(u*,x)  -<u,u*>j  , 

(F*x*)(u*)  -  infx\ff(u*,x)  -<x,x*>}  . 

In  particular,  the  minimand  in  (P)  is  given  by 
(FG) (x)  -  supu,K(u* ,x) , 
and  the  maximand  in  (?•)  ia  given  by 
(F*0) (u*)  -  infx  K(u;x). 

The  dual  programs  of  Dantzig,  Eisenbcrg  and  Cottle  [6], 

[•«] 

Stoer  14^) f  Mangasarian  and  Ponr-tein  [?b,,  Falk  and  Thrall  may  be 

a 

obtained  in  this  way,  for  instance  by  applying  the  Legendre 
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transfomation  formula  (?.lu)  to  (4.1?)  and  similar  devices. 

The  pair  of  functions  <FJu;x>,  <uJF,x>,  is  conjugate 
to  the  pair  of  functions  <Fu,x*>,  <u,F*x*>,  in  the  following 
sense.  If  K  is  any  one  of  the  concave-convex  functions 

such  that 


(4.13)  <Fu,x*>  <  K(u,x*)  <  <u,F*x*> 

(such  functions  all  being  essentially  the  same,  except  at 
special  points),  one  has,  according  to  the  definitions, 


(4.14) 

infu 

< 

SUPx. 4 

[<-. 

ti*> 

♦ 

<X, 

x*> 

- 

K(u,x* 

eupx. 

infu1 

[<u, 

u*> 

+ 

<X, 

x*> 

- 

K(u , x* 

j> 

=  <F^u* ,x>. 

On  the  other  hand,  if 

K* 

i 

s 

any 

one 

of 

t  u  °  t 

unctions 

satisfying 

(*.15) 

<FJu* , 

x>  < 

K* 

(u 

\x) 

<  < 

u* 

♦?.*>< 

one  has  in  turn 

(4.16) 

infu* 

8UPx 

|<U, 

u*> 

♦ 

<x, 

x*> 

- 

K* (u  * , 

x) 

i 

1  -  <u ,F*x*>, 

8UPX 

inV  J 

K 

u*> 

♦ 

<X, 

x*> 

- 

K*(u* , 

x) 

}  *  <Fu ,  x*>. 

Applying 

(4.3) 

to  the 

con 

vex 

bifun 

of.  ion 

f:  i 

place  of 

F,  we  have 

(4.17) 

cl  ,< 

u* 

FJu* 

,x> 

■ 

<u* 

v  X 

1 

<f:u* 

,x> 

»  c 

i 

*  X 

<u*. 

r\  „  'v 
' 

1 

and  this 

makes 

possible  a 

de 

ta 

i  led 

co  : 

r  h 

n  son 

c  f 

the  *'  inf  sup" 

in  (4.14) 

.  In 

particular 

wc 

e  e  u 

hat 

th 

ese  tw 

0 

extrema  are 

"  usually"  e^ualj  the  fact  that  tn ey  can  b*  different  in  some 
cases  1  s  exactly  dual  to  the  fact  tr.«t  th-  y  per  and  lower 
closure  operations  for  cor.cave-co:  vex  functions  do  not  always 
coincide.  A  tsinimux  theory  from  t.is  point  of  view  was 
developed  by  the  author  in  13*0. 
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APPLICATIONS  OF  CONTROL  THEORY  TO  ECONOMIC  GROWTH 
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Lecture  1 

Review  of  the  Basic  Theorems  of  Optimal  Control  Theory;  Finite  Horizon 

The  basic  criteria  for  optimization  of  dynamic  processes  in  continuous 
times,  as  stated  by  L.  S.  Pontryagin  and  associates  [1962]  will  be  restated 
in  this  and  the  following  lecture.  Some  emphasis  will  be  placed  on  special 
features  appropriate  to  the  use  that  will  be  made  of  these  theorems  in  growth 
theory,  in  particular  the  assumption  of  an  infinite  horizon  and  the  presence 
of  constraints  on  the  choice  of  control  variables. 

The  object  of  study  is  a  system,  economic  or  other,  evolving  in  time. 

At  any  moment,  the  system  is  in  some  state,  which  can  be  described  by  a 
finite-dimensional  vector  x(t).  For  an  economic  system,  the  amount  of 
capital  goods  of  each  type  might  constitute  a  suitable  state  description. 

In  an  optimization  problem  there  is  some  possibility  of  controlling 
the  system.  At  any  time,  t,  there  is  a  vector  v(t)  which  can  be  chosen 
by  a  decision-maker  from  some  set  which  may,  in  general,  vary  with  both  t 
and  the  state  x(t).  The  vector  v(t)  is  frequently  referred  to  as  the 
decision  or  control  variable;  following  the  terminology  of  Tinbergen  [1952, 
p.  7],  the  term  Instrument  is  used  here.  In  an  economic  system  the  in¬ 
struments  are  typically  the  allocations  of  resources  to  different  productive 
uses  and  to  consumption  or  perhaps  taxes  and  bond  issues  which  at  least 
partially  determine  allocations. 

It  is  assumed  that  the  state  and  the  instrument  variables  at  any  point 
of  time  completely  determine  the  rate  of  change  of  the  state  of  the  system. 
Thus,  for  a  given  technology  and  labor  force,  the  capital  structure  (state) 


together  with  its  allocation  among  different  uses  (by  some  of  the  in¬ 
struments)  determine  the  outputs  of  all  goods.  These  in  turn  are  al¬ 
located  between  consumption  and  capital  accumulation  (through  other  compo¬ 
nents  of  the  instrument  vector).  In  symbols,  the  evolution  of  the  state 
of  the  system  is  governed  by  the  differential  equations 

(1)  x  =  T[x( t),  v( u),  t], 

which  will  be  referred  to  as  transition  equations.  The  time  variable  t 
may  enter  into  T  to  allow  for  the  possibility  that  the  transition  rela- 
tioris  may  vary  over  time  due  to  technological  progress,  labor  force  growth 
or  other  exogenous  factors. 

Given,  then,  the  state  ot  the  system  at  some  time,  say  0,  and  the 
choice  of  instruments  as  a  function  of  time,  v(c),  the  whole  course  of 
the  system  is  determined.  To  begin  witn,  let  us  suppose  that  the  analysis 
is  carried  out  only  until  a  finite  horizon  T,  after  which  the  process  ceases- 

By  suitable  choices  of  the  values  of  instruments  over  time,  alternative 
histories  of  the  process  can  be  achieved.  Ac  usunl  in  economic  analysis, 
we  assume  chat  these  histories  can  be  valued  in  some  way,  i.e.,  we  can 
express  preferences  as  between  alternative  histories,  and  these  preferences 
car.  be  given  numerical  value,  a  utility  functional  with  arguments  x(t), 
v( c),  (0  <  t  <  T).  fhe  optimisation  problem  is  to  choose  the  values  of 
tne  instrument  variables  so  as  to  maximize  tne  utility  functional  subject 
to  the  constraints  implied  by  (l),  the  constraints  on  the  choices  of  the 
instruments,  and  the  initial  values  of  the  state  variables. 

More  specifically,  it  will  be  assumed  tnat  the  utility  functional  is 
additive  over  time.  That  is,  at  each  moment  t  there  1b  a  return  or 
felicity  (to  use  a  term  due  to  Gorman  [1957*  P*  *»33)  which  depends  only  on 


the  values  of  the  state  variables  and  instruments  at  time  t,  such  that 
the  utility  of  a  whole  history  is  the  sum  or  integral  of  the  values  of  the 
felicities  at  each  moment  of  time.  Let 

(2)  U(x,v,t)  =  felicity  at  time  t  if  the  state  is  x  and  the 
instrument  vector  is  v. 

In  additition  to  the  felicity  generated  at  each  moment  of  time  during 

the  process,  the  decisicn-maker  may  also  assign  a  value  to  the  state 

achieved  at  the  end  of  the  process,  T.  In  an  industrial  application 

* 

the  stock  of  machines  may  have  a  scrap  ^alue ,  and  we  will  use  this  term 
generally.  In  a  broader  economic  context,  if  T  is  not  literally  the 
end  of  the  world  but  only  the  end  of  the  planning  period,  the  capital 
stock  left  over  at  T  will  have  some  use  in  the  future.  The  scrap  value 
will  be  denoted  by  S[x(t)1. 

The  general  form  of  ohe  optimization  problem  in  time  is  then,  with 
finite  horizon, 

(3)  iMaximize  U[x(t),  v(t),t)dt  +  S[x(T)l  vith  respect  to  choice  of  the 
instruments  over  time  subject  to  (l),  some  constraints  on  the  choice 

of  instruments  possibly  depending  on  the  current  values  of  the  state 
variables,  and  the  initial  conditions  x(0)  =  x. 

Then  Pontryagin  and  associates  [1962]  (see  also  Halkin  [1964])  have  shown 

Proposition  1 .  Let  v*(t)  be  a  choice  of  instruments  (0  <  t  <  T) 
which  maximizes  (3)*  Then  there  exist  auxiliary  variables,  functions  of 
time,  p(t),  with  the  same  dimensionality  as  the  state  x,  such  that, 
for  each  t, 

(a)  v*(t)  maximizes  H[x(t),  v(t),  p(t),t),  where 

H(x,v,p,t)  -  U(x,v.t)  +  pT(>.,v  ,t); 
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the  function  p( t)  satisfies  the  differential  equations 

(b)  p^  *  -dH/dx^,  evaluated  at  x  =  x(t),  v  =  v*(t), 

P  =  P(t); 

and  the  transversal! ty  conditions 

(c)  p^(T)  =  ds/dx^,  evaluated  at  x  =  x(T), 

hold. 

The  function  H  is  known  as  the  Hamiltonian.  The  auxiliary 
variables  p  can  be  given  an  economic  interpretation"  Consider  the 
maximization  of  the  utility  function  from  any  time  t  to  the  horizon  T; 
the  past  history ,  before  t  ,  affects  this  problem  only  through  the  state 
at  time  tQ,  as  can  easily  be  seen  from  (1)  and  the  additive  nature  of 
the  maximand  in  (3).  Let  this  maximum  be 

(U)  V(x,tQ)  =  max  U[x(t),  v(t),  t]dt  -  Six(T)]},  where  xkt_)  =  x 

Then  the  auxiliary  variables  are  defined  so  that 

(5)  P1  =  • 

An  auxiliary  variable  measures  the  marginal  contribution  of  the  correspond' 
lng  state  variable  to  the  utility  functional  at  time  t  .  Then  p.x, 

*  p^T^  is  the  rate  of  increase  of  utility  dut  to  the  current  rate  of 
increase  of  the  state  variable  x^ ,  and  therefore  H  Is  tne  current 
flow  of  utility  from  all  sources,  both  enjoyed  immediately,  a:-  expressed 
by  U,  and  m  ti  ed  pa ted  to  be  enjoyed  in  tne  future,  as  expressed  by  pT. 
The  current  instruments  are  chosen  then  to  maximize  H.  The  condition  (b) 
i 8  an  equilibrium  condition  for  holding  the  state  variables  constant  (at 
an  instant  of  time);  the  increment  in  utility  plus  speculative  gain  should 

be  zero;  if  not,  the  individual  wcuid  have  wanted  to  have  less  or  ax  re  >  f 
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that  scate  variable  (read,  stock  of  a  capital  good  in  the  economic  context). 
Finally,  at  time  T,  V(x,T)  =  S(x);  hence  (c)  holds  by  (5). 

In  the  sequel,  a  slightly  different  formulation  of  end-of-period 
conditions  viil  be  useful.  Instead  of  a  scrap  value,  require  simply  that 
the  end-of-period  values  of  the  state  variables  be  non-negative.  Now 
approximate  this  condition  by  a  scrap  value  function;  that  is.  permit 
negative  values  but  impose  a  very  large  penalty.  Formally,  let 

S(x)  £  P  min  (x  .0), 

.i.,1  1  1 

where  min  (x^.o)  means  the  smaller  of  and  0,  and  the  P^'s  are 

chosen  very  large  For  Xj^  <0,  ;  for  xi  >  0,  =  0. 

If  x.,  -  0,  the  right-hand  derivative  is  zero  and  the  left-hand  derivative 

■a. 

f  a  fact  which  may  be  expressed  loosely  by  the  statement  0  <  ds/dxi<  P^. 
Now  let  the  Pi *s  approach  r  ®,  so  that  we  may  be  sure  that  the 
optimal  policy  will  never  lead  to  a  final  negative  value,  x^T): 

x(T)  >  0. 

From  the  preceding  discussion,  ds/^Xj  >  0,  and  farther  if  Xj(T)  >.0, 
then  c^S/cto^  =  0.  In  view  of  Proposition  1(c),  p,  (T)  >  0,  p^(T)  x^T)  «=  0, 
all  i ,  or 

p(T)  >  0,  p(T)  x(T)  *  0. 

Proposition  2.  Let  v*( t )  be  a  choice  of  instruments  (0  <  t  <  T) 
which  maximizes  U[x(t),v( t) ,t Jdt  subject  to  the  conditions 
(a)  x  Tfx(t),  v(t),tj. 

some  constraints  on  the  choice?  of  instr-c  ?nts  possibly  involving  current 
values  of  the  state  variables,  and  the  terminal  conditions,  x(T)  >  0.  Then 
there  exist  auxiliary  -ari  rfb.Lv?  .  v(t).  thv.t.  (a)  ind  (b)  of  Proposi  lion  1 


hold  and  for  which 

(b)  p(T)  >  0,  p( T)  x(t)  »  0. 

Hie  optimal  path  is  the  solution  of  the  differential  equations  (l)  and 
Proposition  l(b)j  the  values  of  the  instruments  which  enter  into  them  are 
determined  as  functions  of  x,  p,  and  t  by  Proposition  1(a).  The  number 
of  these  equations  is  twice  that  of  the  number  of  state  variables.  The 
solution  is  usually  only  determined  when  an  equal  number  of  initial  condi¬ 
tions  are  specified.  Hie  values  of  the  state  variables  at  the  beginning 
of  the  process,  x(0),  are  taken  as  known,  but  these  constitute  only  half 
the  needed  conditions.  The  transversality  conditions,  Proposition  l(c)  or 
Proposition  2(b),  constitute  the  remaining  conditions,  but  from  a  practical 
point  of  view  they  suffer  from  the  severe  difficulty  of  being  defined  at 
the  end  of  the  process,  while  the  other  initial  conditions  are  defined  at 
the  beginning.  The  computation  can  proceed  by  guessing  initial  values, 
p(0),  solving  the  system  of  transition  and  auxiliary  equations  with  the 
hope  that  the  transversality  conditions  are  satisfied,  and  correcting  the 
initial  guesses  if  not.  It  can  also  proceed  by  guessing  the  final  state 

x(T)  and  solving  the  equations  backward  in  the  hope  that  the  initial  condi- 

♦ 

tions  are  satisfied. 

Now  consider  more  explicitly  the  constraints  on  the  instruments.  In 
general,  they  may  depend  on  the  values  of  the  state  variables.  Thus, 
amounts  of  resources  allocated  to  particular  productive  purposes  are 
constrained  by  the  total  amounts  available,  which  in  turn  are  determined 
by  the  state  variables.  The  following  discussion  is  based  on  that  in 
Pontryagin  [1962,  Chapter  VI]  and  on  the  theory  of  r.cnlinear  programming 
due  to  Kuhn  and  Tucker  [1951]- 


Let  the  chc;  ct-  of  instrumec  at  any  time  t  with  state  x  satisfy 
a  vector  of  inequality  constraints, 

(6)  F(x,v,t)  >  0. 

For  example,  if  outpu*  is  a  function  of  the  stock  of  capital,  F(K),  and  if 
output  is  to  be  divided  between  consumption  (c)  and  investment  (l),  then 
the  instruments  C  and  i  satisfy  the  condition, 

F(K)  -  C  -  I  >  0, 

which  involves  the  state  variable  K.  Some  of  the  constraints  in  F 
might  not  include  state  variables;  for  example,  non-negativity  conditions 
on  the  instruments. 

It  is  veil  x- own  from  the  general  theory  of  n  Linear  programming 
that  if  v*  maximizes  H  subject  tc  the  conditions  (6),  and  if  these 
constraints  satisfy  a  certain  condition  known  as  the  Constraint  Qualifica¬ 
tion  (see  Kuhn  and  Tucker,  [ly%,  pp.  J»83-4J;  Arrow,  Hurjioz,  and  Uzawa 

[15-Si}},  then  at  any  sscasent  -of  time  there  exist  multipliers  q°  such  that 

(7)  1  >  c,  i  f'  * ,v*.t )  .  0, 
and 

(3)  v  v>-  %  -  i  ■ 

where 

(9)  L  „  H  ♦  qF* 

It  can  h*  '.ftcvn  that  i'i.d’x,  -  c>L  six,  when  evaluated  at  v  .  v», 

*  * 

o 

q  «  q  . 

With  the  explicit  formulation  (6)  of  constraints,  the  conditions 
for  optimization  over  time  breoese 
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Proposition  3.  Let  v*(t)  be  a  choice  of  instruments  (0  <  t  <  T) 
which  maximizes  U[x(t),v(t),t3dt  subject  to  the  conditions, 

(a)  x  =»  T[x(t),v(t),t], 
a  set  of  constraints, 

(b)  F(x(t),v(t),t]  >  0. 

on  the  instruments  possibly  involving  the  state  variables,  initial  condi¬ 
tions  on  the  state  variables,  and  the  terminal  conditions  x(?)  >  0-  If 
the  Constraint  Qualification  holds,  then  there  exist  auxiliary  variables 
P(t),  such  that,  for  each  t, 

(c)  v*(t)  maximizes  Hfx(t) ,v.p(t),t]  subject  to  the 
constraints  (b),  where  H(x  ,v,p,t)  -  U(x,v.t)  +  pT(x v,t), 

(d)  evaluated  at  x  «  x(t),  v  v*(t),  p  -  p(t), 

where 

(e)  L(x,v,q,t)  =  -H(x,v,p,t)  +  qF(x,v,t), 
and  the  Lagrange  multipliers  q  are  such  that 

(f)  dL/dvk  *  0,  for  x  =  x(t);  v  —  v*(t),  p  ==  p(t), 

Q  >  0,  qF [ x(  t ) ,  v*(  t ) , t ]  =  0, 

and 

(g)  P(T)  >  0,  p(T)  x(T)  =  0. 

In  many  circumstances  it  is  reasonable  to  consider  in  addition 
restrictions  on  the  state  variables  in  which  the  instruments  do  not  enter. 

In  particular,  if  the  state  variables  are  stocks  of  capital,  negative  values 
have  no  meening.  Here,  non-negativity  conditions  on  the  state  variables, 

(10)  x( t)  >  0, 

will  be  considered;  the  terminal  condition  x(T)  >  0  is  implied. 
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For  any  i,  if  x^(t)  >  0,  then  the  corresponding  constraint  (10)  is 

ineffective  and  can  be  disregarded.  Suppose  that  x^(t)  =  0  over  seme 

interval.  Then,  to  avoid  violation  of  (10),  the  instruments  must  be  so 

constrained  that  x^(t)  >  0,  and  this  constraint  is  clearly  effective 

over  that  interval.  But  x^  =  so  that  the  constraint  T^(x,v,t)  >  0 

is  effective  over  that  interval.  Then,  in  Proposition  3,  this  constraint 

can  be  regarded  as  added  to  the  original  set  of  constraints  (b).  Let  q. 

be  the  Lagrange  multipliers  associated  with  the  original  constraints  (b), 

and  let  r.  be  the  multiplier  associated  with  the  new  constraint  T.  >  0. 

i  is 

As  before,  r^  >  0.  Define,  in  addition,  r^  =  0  for  each  state  variable 
for  which  xi(t)  >  0.  Then  clearly  r  >  0,  rT  =  0,  rx  =  0. 

Proposition  4.  Let  v*(t)  be  a  choice  of  instruments  (0  <  t  <  T) 
which  maximizes  J*  U[x(t),v(t),t]dt  subject  to  the  conditions, 

(a)  x  -  T[x(t),v(t),t], 
a  set  of  constraints, 

(b)  F[x(t),v(t),t]  >  0, 

involving  the  instruments  and  possibly  the  state  variables,  initial  condi¬ 
tions  on  the  state  variables,  and  the  non-negativity  conditions, 

(c)  x(t)  >  0, 

on  the  state  variables.  If  the  Constraint  Qualification  holds,  then  there 
exist  auxiliary  variables  p(t)  such  that,  for  each  t, 

(d)  v*(t)  maximizes  H(x(t),v,p(t),tJ  subject  to  the 
constraints  (b)  and  the  additional  constraints  T^[x(t),v,t]  >  0  for  all 
i  for  which  x±( t)  ■  0,  where  H(x,v,p,t)  -  U(x,v,t)  ♦  pT(x,v,t ); 

(e)  p  a  -dL/dx^,  evaluated  at  x  »  x(t),  v  »  v*(t),  p  ■ 

P(t),  *  -  q(t),  r  >  r(t),  where 
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f 


(f)  L(x,v,p,q,r,t)  =  H(x,v,p,t)  +  qF(x,v,t)  +  rT(x,v,t), 

-4  > 

and  the  Lagrange  multipliers  q  and  r  are  such  that 

(g)  ^L/^vk  =  0,  for  x  =  x(t),  v  •  v*(t),  p  ^  p(t), 
q(t)  >  0,  q(t)  F[x(t),  v*(t),t]  =  0, 

r(t)  >  0,.  r(t)  x(t)  -  0,  r(t)  T[x(t),v*(t),t]  =  0; 

(h)  p(T'  >  0,  p(T)  x(T)  =  0. 

So  far  the  propositions  stated  have  been  necessary  conditions  for  the 
optimality  of  a  policy.  The  situation  is  precisely  analogous  to  the  usual 
problem  in  calculus;  the  condition  that  a  derivative  be  zero  is  necessary 
for  a  maximum  but  certainly  not  sufficient  in  general.  However,  the 
condition  is  sufficient  if  the  function  being  maximized  is  concave.  A 
basic  property  of  concave  functions  is  the  following: 

(11)  If  f(x)  is  a  concave  function,  then  for  any  given  point  x* 
and  any  other  point  x  in  the  domain  of  definition,  f(x)  < 
f(x*)  +  f*(x-x*),  where  f*  is  the  row  vector  with  components 
df/dx^  evaluated  at  x*. 

Define  the  function 

(12)  H°(x,p.t)  n  max  H(x,v,p,t),  wh?re  v  is  constrained  as  in 

v 

any  cf  the  Propositions  1-U. 

Then  the  concavity  of  H°  as  a  function  of  x,  for  given  p  and  t, 
implies  that  the  Fontry&gin  conditions  are  sufficient  for  optimality. 

(This  is  a  minor  variation  of  a  theorem  of  Mangasarian  [1966].) 

Proposition  5.  If  H°,  as  defined  in  (12),  is  concave  in  x  for 
given  p  and  t,  then  any  choice  of  instruments  satisfying  the  conditions 
of  any  of  Propositions  1-4  is  optimal  for  the  corresponding  problem. 
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Lecture  2 


Optimization  with  Infinite  Horizon 

For  many  purposes  it  is  more  convenient  to  introduce  the  fiction  that 
the  horizon  is  infinite.  Certainly  processes  of  capital  accumulation  for 
the  economy  as  a  whole  have  no  natural  stepping  place  in  the  definable 
future.  At  any  given  future  date  the  state  of  the  system  (its  capital 
structure)  will  have  implications  for  the  further  future.  If  we  choose 
to  stop  our  analysis  at  any  fixed  date,  it  will  be  necessary,  as  already 
noted,  to  include  in  cur  utility  functional  some  scrap  value  for  the  stock 
of  capital  at  the  end  of  the  period.  But  the  only  logically  consistent 
way  of  doing  so  is  to  determine  the  maximum  utility  attainable  in  the 
further  future  starting  with  any  given  stock  of  capital.  Of  course,  the 
astronomers  assure  us  tnat  the  world  as  we  know  it  will  come  to  an  end 
in  some  few  billions  of  years.  But  as  elsewhere  in  mathematical  ap¬ 
proximations  to  the  real  world,  it  is  frequently  more  convenient  and  more 
revealing  to  proceed  tc  the  limit  to  make  a  mathematical  infinity  in  the 
model  correspond  to  the  vast  futurity  of  the  real  world. 

Formally,  the  only  change  in  the  statement  of  the  model  is  to  let 
I»  +  •■  But  going  to-  the  limit,  here  as  elsewhere,  involves  acme  risk*. 
The  utility  functional,  now  an  improper  integral,  might  not  converge  at 
all;  and  even  if  It  does,  there  might  not  exist  an  optimal  policy.  How¬ 
ever,  it  is  still  possible  to  state  necessary  conditions  and  sufficient 
conditions  for  optimality,  though  existence  of  an  optimal  policy  may  be 
difficult  to  guarantee,  and  also  it  is  not  yet  known  how  to  state  the  ap¬ 
propriate  transversal! ty  conditions.  An  extensive  discussion  of  a  case  of 
non-existence  of  an  optimal  path  is  given  by  Koopmans  (1965*  PP»  251*31* 

i . 


If  an  optimal  policy  exists,  then  it  can  be  shown  that  the  arguments 
for  the  necessity  conditions  of  Propositions  1-4,  except  for  the  trans- 
versality  conditions,  are  still  valid.  In  the  cases  of  interest  in 
economics,  the  transversality  conditions  (Propositions  1(c),  2(b),  3(g)> 
or  4(g)  are  in  fact  valid,  but  so  far  it  is  necessary  to  verify  this  in 
each  case.  The  infinite-horizon  statement  of  the  transversality  condi¬ 
tions  of  Propositions  2-4  is: 

(13)  lim  p(t)  >  0,  lim  p(t)  x(t)  =  0. 

t  -♦  +®  “  t  -*  +“ 

The  sufficiency  theorems  remain  completely  valid,  with  the  trans¬ 
versality  condition  (13). 

It  is  customary  and  reasonable  to  assume  that  future  felicities  are 
discounted;  i.e.,  the  felicity  obtained  at  time  t  is  multiplied  by  a 
discount  factor  a(t),  which  is  ordinarily  taken  to  be  a  decreasing 
function  of  t.  This  corresponds  to  the  intuitive  idea  that  future  pleas¬ 
ures  are  counted  for  less  today.  The  utility  functional  is  rewritten: 

^4)  J***  a(t)  U[x(t),v(t),t]dt. 

Ordinarily,  it  is  assumed  that  if  the  chosen  policy  leads  to  a 
constant  felicity,  (14)  will  converge.  This  is  equivalent  to  the  condi¬ 
tion, 

(15)  0(t)dt  converges. 

If  we  follow  the  earlier  line  of  argument  we  would  be  interested  in 
the  maximum  utility  obtainable  starting  at  some  time  tQ,  analogous  to 
(4): 

(16)  V(x,tQ)  .  max  0<t)  U[x(t),v(  t),t]dt,  where  x(tQ)  «  x. 
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However,  this  means  that  felicities  for  times  beyond  t  are  being  dis¬ 
counted  to  time  0.  It  is  more  natural  to  discount  them  to  time  t  . 

o 

Since  one  unit  of  felicity  at  time  tQ  is  equivalent  to  oe( tQ)  units  at 
time  0,  it  is  necessary  to  divide  V(x,tQ)  by  a(to)  to  obtain  the 
current-value  return  function, 

(17)  W(x,to)  =  V(x,tQ)/a(to). 

Previously  we  obtained  the  auxiliary  variables,  p(t),  as  the  marginal 
contributions  of  the  state  variables  to  the  utility  functional,  ■ 
dv/cbc  .  In  the  present  context  it  seems  more  reasonable  to  define 

(10)  Pj^  a  dw/dxj,  =  (dv/dx^/a  . 

In  applying  Proposition  4  (apart  from  the  transversality  condition) 
to  the  discounted  infinite -horizon  case,  it  is  then  necessary  to  replace 
U(x,v,t)  by  a(t)  U(x,v,t)  and  p(t)  by  a(t)  p(t).  The  Hamiltonian 
becomes 

(19)  a(t)  u(x,v,t)  +  a(t)  p(t)  T(x,v,t)  -  a(t)  H(x,v,p,t), 
where  we  now  define  the  current-value  Hamiltonian: 

(20)  H(x,v,p,t)  a  U(x,v,t)  +  pT(x,v,t)  . 

Then  a(t)  H  must  replace  H  throughout  the  restatement  of  Proposition  4. 

Since  a(t)  >  0,  the  choice  of  Instruments  to  maximize  a(t)  H 
is  the  same  as  that  to  maximize  H,  so  that  Proposition  4(d)  remains 
unchanged.  If  we  interpret  the  Lagrange  multipliers  q,  and  r  as 
referring  to  the  maximization  of  H  as  now  defined  subject  to  the 
constraints,  then  L  must  be  replaced  by  Q^t)  L.  Proposition  4(e)  be¬ 
comes 


d(o<t)piJ  d[a(t)L] 


ij 
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c*  p^  +  a  Pi  *  -  a(dL/^  )  . 


Id  vide  through  by  a,  and  define 
(19)  p(t)  a  -Q<t)/a(t)  . 


Then 

(20)  PA  .  p(t)pt  -  . 


In  economic  terms,  p(t)  is  a  short-term  interest  rate,  correspond¬ 
ing  to  the  system  of  discount  factors  a( t).  He  definition  (19)  can  be 
integrated  back  to  yield  the  familiar  fora: 

-Jo  <*u)du 


(21) 


Q(t) 


if  we  adopt  the  convention  that  a(0)  1 .  If  (2 C)  is  written 

Pt  +  «  p(t)  Pt(t), 


It  is  the  familiar  equilibrium  relation  for  investment  in  capital  goods; 
the  sum  of  capital  gains  and  marginal  productivity  should  equal  the  interest 
on  the  investment. 

The  infinite-horizon  analogue  of  Proposition  *  (apart  from  transversal - 
lty  conditions)  becomes: 


Propositi  on  6.  Let  v*(t)  be  a  choice  of  instrument?  (t  >0)  which 
a(t)  U(x(t),v(t),t]dt  subject  tc  the  conditions  (a),  (b),  and  (c)  of 
Proposition  k.  If  the  Constraint  Qualification  holds,  then  there  exist 
auxiliary  variables  p(t)  satisfying  (d)  of  Preposition 

(•)  ?i  •  fPj  *  (&/dxt)/  evaluated  ai  x  *  *{t),  v  •  v*(t),  p  • 
p(t).  q  ■«  q(t),  r  *  r(t). 

where  p(t)  *  -a(t)/a(t),  and  (f)  and  ($)  of  Prcp^^i t’.or.  *  held 


The  sufficiency  theorem,  Proposition  5>  remains  valid  if  the  trans- 
versality  condition  is  replaced  by  (13)  where,  however,  p(t)  is  replaced  by 


Q(t)  p(t). 

Proposition  7*  In  the  notation  of  Propositions  and  6,  if 

H°(x,p,t)  =  max  H(x,v,p,t), 
v 

where  the  maximization  is  over  the  range  specified  in  Proposition  4(d), 
is  a  concave  function  of  x  for  given  p  and  t,  then  any  policy 
satisfying  the  conditions  of  Proposition  6  and  the  transversality 
conditions, 

lim  a(t)  p(t)  >  0,  lim  a(t)  p(t)  x(t)  ■  0, 
t  -♦  +•  “  t  -»  -h» 

is  optimal. 

It  is  frequently  appropriate  to  make  an  assumption  that  the  basic 
conditions  of  the  optimization  problem  are  stationary;  the  sequence  of 
conditions  to  be  encountered  in  the  future  is  much  the  sane  as  today  or 
can  be  made  so  after  same  simple  renormalizations.  This  property  will  be 
heavily  exploited  in  our  subsequent  discussions.  The  basic  stationarity 
assumptions  are  that  the  functions  U(x,v,t),  T(x,v,t),  P(x,v,t),  and  p(t) 
are  all  Independent  of  time.  With  p  constant,  it  follows  from  (21)  that 

(a)  *0  .  «-pt. 

and  the  convergence  condition  (15)  becomes 
(23)  P  >  0. 

Under  the  stationarity  assumption,  the  current-value  return  function, 
V(x,t0),  defined  by  (17),  is  in  fact  Independent  of  tcJ  this  can  be  asen 
by  writing,  in  view  of  the  previous  remarks, 
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W(x,to)  =  (l/e‘pt°)  max  Jt  e'pt  U[x(t),v(t) ]dt 

c 

=  max  J’  e'p(t‘tc)  U[x(t),v(t)]dt. 
o 

Since  the  constraints  F(x,v)  >  0  and  the  transition  relations,  x  =  T(x,v), 

do  not  involve  time  explicitly,  it  is  clear  that  replacing  t  by  0,  say, 

leaves  completely  unaffected  the  form  of  the  optimal  policy.  This  is  an 

illustration  of  Bellman's  [1957]  veil-known  "principle  of  optimality  " 

But  if  W(x,tQ)  =  W(x),  independent  of  t,  then  from  (18)  p  i3 

completely  determined  by  the  state  x  in  the  following  sense:  Suppose 

we  have  two  optimization  problems  of  the  type  oealt  with  in  Proposition  6 

(but  also  satisfying  the  stattonarity  assumptions)  which  are  identical  in 

all  respects  except  for  initial  conditions.  Let  x‘(t)  and  x  (t)  be 

the  paths  of  the  state  variables  along  the  optimal  solutions  for  the  two 

1  ^ 

problems,  respectively,  and  let  p*(t)  and  p~(t)  be  the  corresponding 
paths  of  the  auxiliary  variables.  Then  if  x*(t)  x2t  ).  pl(t)  *  p^(t'). 

Note  that  since  p  is  determined  by  x.  and  U  and  T  do  not 
depend  on  t,  along  the  optimal  path  H(x,v,p,t)  is  a  function  of  x  and 
v  alone,  and  therefore  the  value  of  v  which  maximizes  H  depends  only 
on  x.  The  optimum  policy  can  be  represented  as  a  strategy  or  feedback 
control ,  with  v  a  function  of  x. 

Also  net*  that.,  fer  given  x,  v,  and  p,  R  is  independent  of  t, 
and  therefore  Prepositions  <*  «rd  6(c),  by  itself  implies  that  v»  is 
determined  by  x  and  p,  independent  of  t.  The  statlonarity  assump¬ 
tions  then  imply  that  t  does  not  enter  explicitly  into  the  uysteo  of  dif¬ 
ferential  equations  defined  by  (»)  and  (e)  S~rh  a  system  is  termed 
autonomous- 


if* 


Proposition  6.  Under  the  assumptions  and  in  the  notation  of 
Proposition  6,  suppose  in  addition  that 

(a)  U(x,v,t)  =  U(x,v),  T(x,v,t)  =  T(x,v),  F(x,v,t)  =  F(x.v),  and 
p( -t )  =  p,  all  independent  of  t. 

Then 

(b)  the  optimal  policy,  v*  =.  v*(x),  and  the  values  of  the 

auxiliary  variables,  p,  along  the  optimal  path,  are  functions 

of  x  alone,  independent  of  t  for  given  x; 

(c)  the  system  of  differential  equations  defined  by  (a),  (d),  and 

( e )  is  autonomous . 

For  an  autonomous  system,  considerable  interest  usually  relates  to 

its  stationary  point  or  equilibrium,  where  all  motion  ceases,  i.e.,  the 

•  « 

values  of  x  and  p  for  which  x  »  0  and  p  «  0.  ISiis  notion  in 

economics  is  that  of  long-run  stationary  equilibrium  (as  opposed  to 
temporary  or  short-run  equilibrium  in  which  capital  stocks  are  given). 

In  the  present  system  an  equilibrium  is  defined  by  x*,  p*,  v*  satisfy¬ 
ing  the  conditions: 

T(x*,v*)  •  0, 

PPf  *  l*  > 
i 

v*  maximizes  H(x*,v,p*)  under  the  constraints  F(x*,v)  >  0, 
T^xVv)  >0  if  xj*  .  0. 

If  the  Initial  state  of  the  system  is  x* ,  thee  all  the  conditions 
of  Proposition  6  can  be  satisfied  by  setting  x(t)  -  x*,  v(t)  »  v*,  p(t)  » 
p*  for  all  t.  It  may  be  asked  under  what  conditions  this  solution  la 
optimal.  More  generally,  suppose  we  can  find  a  path  satisfying  the  condi¬ 
tions  of  Proposition  1  which  converges  to  the  stationary  values;  when  is 
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this  optimal? 


For  simplicity  of  reference,  define  a  Fontryagin  path  as  a  system, 
x(t),  p(t),  v*(t),  satisfying  the  conditions  of  Proposition  6. 

Proposition  9-  Let  x(t),  p(t),  v*(t)  he  a  Fontryagin  path  for 
the  problem  of  Proposition  6.  Suppose  further  that  the  concavity  hypoth¬ 
esis  of  Proposition  7  and  the  stationarity  hypothesis  of  Proposition  8, 
with  p  >  0,  are  satisfied.  Then,  if  x(t)  and  p(t)  converge  to  an 
equilibrium,  x*,  p*,  where  p*  >  0,  they  constitute  an  optimal  path. 

Proof:  From  Proposition  7  it  suffices  to  note  that  the  transversai- 
ity  condition  a(t)  p(t)  x(t)  -+  0  is  satisfied.  3^t  p(t)  and  x(t) 
converge  to  finite  limits,  and  a(t)  -  e  ^  approaches  zero  since  p  >  0 

It  should  be  remarked,  however,  that  (a)  there  may  be  more  than  one 
equilibrium,  and  (b)  there  may  exist  optimal  paths  which  do  not  converge 
to  any  finite  equilibrium;  for  examples,  see  Kurz  [1965  and  196j,  Section  Bj. 


Lecture  3 


Optimal  Investment  Planning  in  a  One-Comaodlty  Model 

In  this  lecture  we  review  in  detail  the  simplest  possible  capital 
accumulation  model,  first  studied  by  Ramsey  [1928];  for  further  important 
contributions,  see  MLrrlees  [196?]  and  Koopaans  [1965]*  We  assume  there 
is  only  one  commodity,  which  can  be  either  consumer  or  invested.  We  take 
the  viewpoint  of  a  government  which  is  in  a  position  to  control  the  economy 
completely  and  to  plan  perfectly  so  as  to  optimize  with  respect  to  all  pos¬ 
sible  instruments  of  the  economic  system  -  in  this  case,  only  investment 
and  consumption  (which  are  subject  to  the  constraint  that  their  sum  not 
exceed  total  output). 

We  first  assume  a  constant  population  md  a  constant  labor  force. 

The  felicity  at  any  moment  is  taken  to  be  a  function  of  consumption,  C, 
only.  Bien  the  ait  of  tiie  economy  is  to  maximize 

(24)  V  «  e‘?t  U[C(t)Jdt, 

where  p  is  the  rate  of  interest  on  felicity,  C(t)  is  consumption  at 
time  t,  and  U(C)  is  the  felicity  derived  from  consumption  C.  It  is 
assumed  that 

(25)  U(C)  is  strictly  concave  and  increasing. 

The  output  at  any  moment  of  time  is  a  function  of  the  stock  of  capital 
and  of  the  labor  force.  Since  the  latter  is  assumed  constant,  we  assume 
simply 

(26)  Y(t)  .  F[K(t)J, 
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where  Y( o)  is  output  at  time  t  and  K  is  the  stock  of  capital.  With 
the  labor  force  held  constant,  increases  in  capital  may  be  supposed  to 
yield  lower  and  lower  returns;  also  it  is  assumed  that  capital  is  indispens¬ 
able  to  production. 

(27)  F  is  strictly  concave,  F(0)  =  0. 

It  is  not  necessarily  assumed  that  F(K)  is  increasing;  for  example,  if 
the  stock  of  capital  depreciates  at  a  rate  proportional  to  its  quantity, 
then  the  depreciation  ought  properly  to  be  subtracted  from  the  gross  out¬ 
put  to  get  a  true  measure  of  net  output  available  for  consumption  and  net 
investment  (increase  of  the  capital  stock).  It  is  possible  that  if  K  is 
very  large,  the  marginal  gross  output  of  an  additional  unit  may  be  less 
than  the  depreciation  on  that  unit 

Finally,  the  accumulation  of  capital  is  precisely  investment,  I, 

(28)  K  =  I, 

and  the  conservation  of  product  flow  implies  that  consumption  and  invest¬ 
ment,  together,  do  not  exceed  output,  i.e.,  C  +  I  <  Y  or,  in  view  of  (26), 

(29)  F(K)  -  C  -  I  >  0. 

It  also  follows  from  the  very  definition  of  capital  that  it  cannot 
be  negative;  K( c)  >  0,  all  t 

At  present  it  will  be  assumed  that  l  may  be  positive,  zero,  or 
negative;  the  latter  means  that  existi.ng  capital  can  be  turned  into 
consumption  goods.  The  case  where  I  is  necessarily  non-negative  will 
be  considered  later.  It  will  be  assumed  that  C  >  0;  but  to  simplify 
matters  it  will  be  also  assumed  for  the  moment  that 

(30)  U’(0)  *  +  •, 


P.0 


which,  as  will  be  seen,  implies  that  the  choice  of  the  instrument  C  at 
any  moment  of  time  will  necessarily  be  positive,  so  that  the  non-negativity 
constraint  is  ineffective. 

Propositions  6-9  can  be  applied  to  this  model.  Hie  state  of  the 
system  is  represented  by  the  single  variable,  K.  There  are  two  instru¬ 
ments,  C  and  I.  The  felicity  function  depends  only  on  the  one  instru¬ 
ment  C;  the  transition  function  (28)  depends  only  on  I.  The  choice  of 
C  and  I  is  constrained  by  (29),  which  corresponds  to  Proposition  6(b) 
or  4(b).  There  will  be  one  auxiliary  variable,  p,  so  that  the  current- 
value  Hamiltonian  is 

H  =  U(C)  +  pi, 
and  the  Lagrangian,  L,  is 

(31)  L  =  lf(c)  +  pi  +  q[F(K)  -  C  -  I] 

=  [U(C)  -  qC]  +  (p-q)l  +  q?(K)  . 

3y  Proposition  6(g)  or  4(g),  C  and  I  must  be  chosen  so  that 
$L/dc  a  0  and  dL/dl  a  0.  The  latter  implies  that 

(32)  p  =  q- 

The  former  implies  that  U'(C)  «  q  and,  by  (32), 

(33)  U'(C)  -  p. 

Because  of  (30)  and  the  concavity  of  U(C),  it  is  assured  that  the  solu¬ 
tion  to  (33)  will  be  positive. 

The  auxiliary  equation,  Proposition  6(e),  becomes 
* 

P  -  pp  -  q(&/&), 

(34)  P  -  [p  -  F'(K)]p, 

in  view  of  (31)  and  (32).  Since  U'  >  0  by  (25),  p  >  0  by  (33)#  and 
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q  >  0  by  (32);  the  constraint  (29)  is  effective,  so  that  from  (28) 

(35)  K  -  F(K)  -  C(p), 
where 

(36)  C(p)  is  the  solution  of  (33). 

Equations  (34)  and.  (35)  are  a  pair  of  autonomous  equations.  An 

■  •  * 

equilibrium  is  defined  by  p  *  0  and  K  *  0.  But  p  -  0  means 
p.O  or  F'(K)  =  p. 

3ince  U'  >  0,  the  first  alternative  is  impossible  from  (33)  Let  K*, 
p"  be  the  equilibrium  values  of  K  and  p,  respectively,  and  let  C™ 
and  I*  be  the  values  of  the  instruments  at  the  equilibrium.  It  has  Just 
been  shown  that 

(37)  F'(K*)  «  p. 

8ince  F  is  strictly  concave,  F'  is  strictly  decreasing.  It  will  be 
assumed  that  (37)  has  a  solution  with  K*  >  0.  Ibis  is  equivalent  to 
assuming  that  F'(0)  >  p,  F'(K)  <  p  for  K  sufficiently  large. 

From  (28)  and  the  definition  of  equilibrium, 

(38)  r  -  0. 

« 

From  (35)  and  (36),  with  K  -  0, 

(39)  C*.  F(K"), 

(40)  P-.U-(C-) 

Consider  now  all  solutions  of  the  differential  equations  (34)  and 
(35)>  Their  movements  may  be  represented  in  a  phase  diagram  (Figure  1). 
Since  F'(K)  is  decreasing,  p  -  F‘(K)  Is  increasing;  from  (34),  then, 

9  s 

p  >  0  if  K  >  K*,  p  <  0  if  K  <  K*\  Since  IT  is  a  decreasing  function, 
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-t  follows  from  (36)  and  (33)  that  C(p)  is  a  decreasing  function  of  p. 
Hie  curve  for  which  K  «  0  is.,  from  (35),  defined  by  the  equation 

F(K)  =»  C(p). 

This  equation  can  be  solved  uniquely  for  p  in  terms  of  K;  call  the 
solution  «(K).  Since  F(K)  is  concave,  and  F'CK00)  a.  p  >  0,  F(K) 
is  either  always  increasing  or  increasing  up  to  a  value  K  >  K;  there¬ 
fore,  «(k)  is  decreasing  for  all  K  or  else  decreasing  to  K  and 

* 

increasing  thereafter.  Further,  for  fixed  K,  K  F(K)  -  C(p)  is  an 

#  • 

increasing  function  of  p  so  that  K  >  0  above  the  curve  8nd  K  <  0 
below.  Hie  components  of  the  directions  of  movement  in  the  four  quad¬ 
rants  into  which  the  diagram  is  divided  by  the  curve  K  ...  0  and  the 

vertical  line  p  *  0  are  indicated  In  Figure  !  by  arrows;  note  that 
•  »  ♦ 

K  >  0  is  a  movement  tc  the  right,  and  p  >  0  an  upward  movement. 

Regions  XI  and  IV  are  traps  in  the  sense  that  a  Fontryagin  path 

which  enters  either  of  these  regions  never  leaves  it  Further.,  any  path 

which  comes  to  a  boundary  of  either  region  must  enter  that  region  and 

then  remain  in  it  permanently.  It  will  now  be  shown  that  a  path  which 

enters  either  region  cannot  ^e  optimal. 

Consider  first  region  IV.  Without  loss  of  generality,  suppose  that 

the  path  is  in  region  IV  at  time  0  Then  p  >  0,  K  >  0,  all  t  Since 

K(0)  >  K°°,  K(t)  >  K(0)  >  K°°,  all  t.  Since  F’  is  decreasing, 

F'[K(t)]  <  F'[K(0)3  <  F  !(KJ°)  ■-  p,  sc  that,  from  (3a),  p/p  *  p  -  F’[K(t)J 

>  €  «  p  -  F’[K(0)]  >  0;  p(t)  >  p(0)  ect  bv  integration,  so  that., 

certainly, 

p(t)  >  p"  for  all  t  >  t  . 

for  some  tQ.  Since  Cti  l-  1  d-c  ng  function  f. . 


C(t)  <  c(p“)  *  C*  for  t  >  t  . 

Since  X(tQ)  >  X**,  It  follows  that  ve  can  always  improve  on  the  given 
path  by  consuming  the  capital  stock  (diainvestlng)  in  tome  interval  begin¬ 
ning  at  tQ  until  K  diminishes  to  X*,  after  which  the  equilibrium 
policy,  C*e"»  P(K*),  K  «  X*,  is  maintained  perpetually. 

Row  consider  any  trajectory  in  region  II.  By  the  same  reasoning 

a 

p/p  <  c,  where  now  e  •  p  -  F'[K(0)]  <  0.  But  then  p(t)  -*  0,  which 
implies  C[p(t)]  -*  +  ».  Since  7(x)  is  urdfomly  bounded  on  the  closed 

4 

interval  <0,  X°°  >,  K  *  7[X(t)]  -  C[p(t)]  <  t  <  0  from  scam  tiste-  on . 

££ 

ltoen  K(t)  must  become  zero  at  some  finite  time.  Since  C  >  0,  then, 

I  ■  f(0)  -  C  <  0,  and  K  will  become  negative,  violating  the  non¬ 
negativity  of  K. 

Consider  now  a  path  starting  in  quadrant  I.  If  it  stays  in  quadrant 
I  forever,  then  both  p  and  X  are  bounded  from  below.  Since  both  are 
decreasing,  they  approach  limits  which,  by  a  general  theorem  on  differential 
equations,  can  only  be  the  equilibrium  values.  By  Proposition  9  such  a 
path  la  necessarily  optimal.  If  the  path  did  not  remain  in  quadrant  I 
for  all  t,  then  it  reaches  either  the  boundary  with  quadrant  IV  or  that 
with  quadrant  II;  then,  as  already  noted,  the  path  cannot  be  optimal. 

Similarly,  any  path  in  quadrant  III  which  remains  there  forever 
approaches  the  equilibrium  and  is  optimal;  any  other  path  la  non-optimal . 

It  only  remains  to  argue  that,  for  any  Initial  K  •  X(0),  there  la  a 
corresponding  p(0),  with  the  point  (X,  p(0))  in  quadmnt  I  or  quadrant 
III  according  aa  X  <  X*  or  X  >  X*,  suoh  that  the  Fontryagln  path  start¬ 
ing  at  that  point  approaches  the  equilibrium.  Such  a  path  la  certainly 
optimal. 
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The  approach  makes  use  of  the  fact  that,  under  the  stationaxity  «•> 

■»  a 

assumptions  of  thia  problem,  p  and  the  optimal  Instruments  C  and  I 
are  functions  of  X.  Divide  (34)  by  (3?)  to  see  that 

(41)  dp/dK  -  p[p-r(K))/(F(K)-  C(p)J. 

Consider,  for  values  of  X  <  K*,  solutions  of  (4l)  which  Intersect 
the  line  K  «  K*  above  the  equilibrium,  l.e.,  for  p(X**)  a  prescribed 
value  greater  than  p".  Such  a  solution  can  be  continued  for  smaller  and 
smaller  values  of  K.  We  first  note  that  It  can  never  cross  the  curve 

e 

K  *  0,  which  has  been  written  p  *  x(K).  Let  p(K)  be  the  solution 
of  (41),  and  suppose  it  intersected  the  curve  p  =  x(X)  at  K  <  X*. 

Then  p’(K)  -♦  -  •  as  K  ~*K  >  0.  But  p(K)  >  x(X)  for  K  In  e  right- 
band  neighborhood  of  K,  and  therefore  p'(X)  >  x’(K),  a  contradiction 
alnce  *’(X)  la  certainly  finite. 

Since  the  denominator  of  (41)  is  finite  end  the  numerator  la  hounded 
from  above,  it  la  clear  that  the  solution  of  (41)  can  be  continued  for  all 
positive  values  of  X  <  K*\  There  is  one  such  solution  for  each  value  of 
p(X**)  >  P**-  These  solutions  never  cross  because  of  the  uniqueness  of 
solutions  of  this  differential  equation  away  from  the  equilibrium  point. 

Banes,  for  any  given  K  <  X*,  there  la  a  lower  bound,  £(K)»  on  the 
values  of  p  for  whloh  there  exists  a  solution  of  (41)  passing  through 
(K,p)  and  for  whloh  p(x")  >  P*.  It  la  obvious  and  can  easily  be 
demonstrated  that  g(K)  also  satisfies  (41),  and  that  £(x")  •  p".  This 
path  In  (p,K)- space  defines  the  optimal  trajectory.  If  X(0)  <  X**, 
choose  p(0)  -  £(K(0)).  Then  the  points  of  the  time  solution,  p(t), 

K(  c),  for  (34)  and  (33)  move  along  the  trajectory  £(X)  and  converge 
to  the  equilibrium 
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Ihe  solution  in  this  form  is  very  convenient,  for  the  choice  of  the 
instruments,  C  and  1,  is  determined  as  a  function  of  K  by  (33)  and 
(29)  (with  equality). 

Ihe  analysis  in  quadrant  III  is  the  same,  except  that  ve  find  for 
each  K  >  K*  the  upper  bound  of  p-values  for  which  the  solution  of  (bl) 
passes  below  the  equilibrium. 

It  should,  however,  be  noted  that  we  could  apply  the  same  proct  'Hire 
in  quadrants  II  and  IV;  but  then  the  limiting  solution  would  be  the 
divergent  dashed  curves  marked  in  figure  1. 

Ihe  optimal  solution,  then,  is  defined  by  a  solution  of  (bl)  which 
passes  through  the  equilibrium;  but  there  are  two  such  solutions.  The 
equilibrium  is  a  singular  point  of  (bl),  so  the  solution  through  that 
point  need  not  be  unique.  In  this  case  it  is  clear  that  the  optimal 
solution  is  identified  as  the  one  with  the  negative  slope  at  the  equilib¬ 
rium. 

We  will  analyze  the  non-uniqueness  at  equilibrium  a  little  more 
closely.  The  right-hand  side  of  the  differential  equation  (bl)  is, 
strictly  speaking,  not  defined  at  K  •  K*,  p  •  p",  since  both  numerator 
and  denominator  vanish.  Since  p  is  to  be  a  function  of  K,  both 
numerator  and  denominator  are  functions  of  X,  directly  and  through  p. 
Z*t 

(*2)  000  •  P(P-r(K)],  HK)  -  f(K)  -  C(p), 

(*3)  P*(K)  •  0(X)/f<X)  . 

Since  both  0  and  f  vanish  at  X  *  X*,  we  can  define  p*(X)(«dp/dR} 
there  by  L'8opitai‘s  rule: 

(bb)  P*(X*)  -  «x")/f(x*), 


a? 


and  it  remains  to  evaluate  these  derivatives. 


also  asaune  K(0)  given.  Define  X*,  C°°,  p°°  by  the  relations  P'(K*)  * 
p,  C«  »  P(K"),  p*  *  U'(C*). 

Then  the  optimal  strategy  can  be  characterized  by  finding  that  solu¬ 
tion  p(K)  of  the  differential  equation  ( 41 )  for  which  pfK*)  ■  p“  aud 
for  which  p’(K°°)  is  the  negative  root  of  the  quadratic  equation 
(-1/U"(C“)J  [p'(K“)J2  ♦  P  p'(k")  ♦  p"  P”(K*)  -  0.  Then,  for  any  K, 

C  is  so  chosen  that  ’J'(C)  ■  p(K),  and  I  ■  F(k)  -  C. 

Proposition  10  has  been  stated  without  the  hypothesis,  U'(0)  «  +  », 
which  was  used  in  the  proof.  It  will  be  an  interesting  exercise  in  the 
use  of  Proposition  6  to  consider  the  case  where  U'(0)  is  finite.  In 
this  case  the  constraints  C  >  0  and  h  >  0  may  Become  effective. 

x  at 

Consider  the  first  for  regions  in  which  X  >  0.  Let  w  be  the  multiplier 
associated  with  the  constraint,  C  >  0.  Then  { 31 )  is  modified  to  read 

(U6)  L  -  U(C)  ♦  pi  ♦  q(P(K)  -  C  -  Ij  ♦  wC 

-  [u(c)  -  (q-v)  Cl  ♦  (p-q)  I  ♦  qp(x), 

where 

(47)  w  >  0,  wC  -  0. 

If  C  »  0,  then  the  condition  dl/dc  ■  0  becomes 
U'(0)  .  U*(C)  -  q  -  v  <  q. 

Vie  still  have  the  condition  p  •  q,  so  that  C  «  0  if  p  >  U'(0).  The 
system  (34)  and  (35)  is  still  valid,  but  the  definition  of  C(p)  is 
slightly  modified; 

C(p)  is  the  solution  of  the  equation,  U*(C)  •  p 
if  p<U*(0), 

C(p)  -  0  if  p  >  U'(0). 
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Uie  previous  analysis  is  completely  unchanged;  in  Figure  1,  the 
curve  K  -  0  intersects  the  p-axis  at  p  =  U'(0)  instead  of  being 
asymptotic  to  it.  Since  the  optimal  trajectory  lies  above  the  curve 
K  =  0,  ther°  will  be  a  k  >  0  for  which  p(k)  =  U'(0).  For  k  <  k , 

C  will  then  be  zero. 

How  consider  the  possibility  that  the  constraint  K  =  0  becomes 
effective.  s  has  already  been  seen,  this  question  arises  only  for  paths 
which  start  in  or  have  entered  region  II.  It  will  be  shown, even  with 
U'(0)  finite,  suen  paths  are  non-optimal. 

Recall  the  basic  definition  (l8)  of  p  as  dW/dK,  where  W  is 
maximum  value  of  the  utility  functional  if  the  initial  state  is  K. 
Clearly,  in  this  model  an  increase  in  K  is  always  beneficial;  given 
an  increase  in  K,  one  can  always  consume  a  somewhat  higher  amount  for 
some  period  until  the  value  of  K(t)  falls  to  that  on  the  original  path, 
and  then  follow  the  latter  thereafter.  Hence,  p  must  be  positive. 

No'*  consider  a  path  that  has  reached  the  p-axis  at  time  tQ  from 
region  II.  Since  the  initial  value  of  p  was  finite,  the  time  to  reach 
the  p-axis  was  finite,  and  the  right-hand  side  of  (34)  is  bounded  over 
this  path,  p(tQ)  is  finite.  Now  the  constraint  K  >  0  becomes  ef¬ 
fective  and  therefore  the  constraint  I  >  0  is  imposed.  The  constraints, 

C  >  0,  I  >  0,  C  +  I  <  F(K)  «  F(0)  =  0, 

imply  that  C  and  I  are  0;  from  the  latter,  it  follows  that  K(t)  =  0 
for  all  t  >  t  .  The  Lagrangian  (46)  is  modified  by  the  addition  of  a 
term  corresponding  to  the  constraint  I  >  0,  with  multiplier  r. 

T,  =  U(C)  +  pi  +  q[F(K)  -  C  -  I]  +  wC  +  rl 
*  [U(c)  -  (q-w)  C]  +  (p+r  -  q)  I  +  qF(K), 


where  r  >  0  when  1*0.  No  longer  does  the  equality  p  =  q  hold; 
Instead, 

p  =  q  -  r  <  q. 

Equation  (3^)  is  modified  to 

p  *  p  ?  -  q?'(K)  *  p  p  -  qF'(O)  for  t  >  tQ. 

Recall  that  ?*(0)  >  p.  Use  in  turn  the  inequalities  p  <  q,  U'(0) 

<  q. 

P  =  P  P  -  qF’(O)  <  q[p  -  F'(0) 1  <  U’(0)  [o  -  F'(0)]. 

The  last  term  is  a  negative  constant.  Hence,  y  must  become  negative  in 
finite  term,  which  is  a  contradiction  to  the  assumption  of  the  optimality 
of  the  path  being  studied. 

It  may  be  asked  what  happens  if  the  initial  stock  of  capital  is  0. 
Ifce  only  feasible  path  is  that  of  zero  investment  and  consumption.  Ihe 
argument  just  given  would  show  that  for  any  finite  p(0),  p(t)  would 
become  negative  eventually.  The  answer  evidently  is  that  p(0)  must  be 
chosen  +  «  initially,  and  then  p(t)  would  remain  +  ». 
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Lecture  4 


Further  Aspects  of  the  Ramsey  Problem:  Irreversibility; 


Growth  of  Population  and  Labor  Force  and  Technical  Change 


It  is  sometimes  reasonable  to  argue  that  investments,  once  made  in 
physical  form,  cannot  be  converted  into  consumer  goods.  Hence,  invest¬ 
ment  should  be  irreversible,  i.e.,  subject  to  the  constraint  I  >  0.  On 
the  other  hand,  there  is  in  real  life  a  somewhat  more  subtle  way  in  which 
capital  can,  within  limits,  be  run  down  and  permit  more  consumption; 
namely,  capital  goods  depreciate  and  failure  to  replace  them  constitutes 
a  way  of  increasing  consumption  at  the  expense  of  capital. 

A  reasonable  assumption  about  depreciation  is  that  a  fixed  fraction 
of  the  existing  capital  becomes  useless  in  each  time  period.  Thus,  the 
net  rate  of  increase  of  capital  is  the  amount  of  (gross)  investment,  i.e. 
new  output  devoted  to  capital  uses,  less  the  amount  of  depreciation.  Hii 
amounts  to  replacing  (28)  by 

(48)  K  *  I  ■  6  K, 

for  some  5  >  0.  We  also  assume  that  investment  is  non-negative, 

(49)  I  >  0. 

Otherwise  the  model  is  identical  with  that  of  the  last  lecture, 
including  (24)  and  (29),  with  the  assumptions  (25),  (27),  and  (30) 

(though  the  last  is  dispensable).  Let  p  be  the  auxiliary  variable 
corresponding  to  (48),  q  the  multiplier  corresponding  to  (29),  and  s 
that  corresponding  to  (49).  The  Lagrangian  becomes 


(50) 


L  =  U(C)  +  p(l  -  6  K)  +  q[F(K)  -  C  -  I ]  +  si 
--  [U(C)  -  qC]  +  (p+s-q)  I  ♦  qF(K)  -  p  6  K, 


where 


(51)  s  >  0,  si  *  0. 

Equating  derivatives  with  respect  to  C  and  K  to  zero  yields 

(52)  U’(C)  =  q, 
q  =  P  +  s, 

which  can  be  combined  with  (51)  to  yield 

(53)  p  <  qj  if  p  <  q,  then  1=0. 

The  auxiliary  equation  is 

(54)  P  =  (p  +  B)  p  -  qF’(K)  • 

Since  the  problem  is  stationary,  we  know  that  the  instruments  and 
the  auxiliary  variable  are  functions  of  K.  Prom  (53),  for  any  given  K, 

there  are  two  possibilities:  either  p  *  q,  or  p  <  q  with  1=0. 

Therefore,  the  K-axis  is  divided,  in  general,  into  alternating  blocked 
and  free  intervals; 

(55)  I  ■  0,  C  =  F(K)j  q  *  U'tP(K))  >  p  on  a  blocked  interval; 

(56)  I  >  0,  C  <  F(K)j  q  =  p  >  U'(F(K)]  on  a  free  interval. 

From  (48),  K  ■  -  B  K  <  0  if  1=0.  Hence,  the  system  cannot  have 
an  equilibrium  in  a  blocked  interval.  Then  p  ■  q  at  an  equilibrium 
and,  from  (54),  (48),  and  (29),  the  following  relations  hold  at  equilibrium: 

.  (57)  F'(K-)  -  B  -  p, 

I*  -  6  K*, 

C*  -  F(K")  -  6  K*, 

p*  ■  U'(c*). 
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It  will  be  observed  that  the  equilibrium  is  the  same  as  that  for 
the  reversible  Ramsey  problem,  where  F(K)  is  replaced  by  F(K)  -  5  K. 
Indeed,  if  we  define 

IH  -  I  -  6  K, 

then,  if  the  constraint  (1*9)  is  Ineffective,  the  problem  is  identical  with 
the  reversible  Ramsey  prt Jlem,  with  I  replaced  by  IR.  For  the  revers¬ 
ible  Ramsey  problem  the  .'o.i*ni  policy  would  have  I„  >  0  for  K  <  K°*, 

IV  *  * 

and  therefore  I  *  IR  ♦  ft  .  0.  Hence,  starting  with  any  such  K,  if 

the  optimal  policy  for  the  reversible  Ramsey  problem  is  followed,  it  will 
always  satisfy  constraint  (49)  and  therefore  remain  feasible  under  ir¬ 
reversibility.  It  will  therefore  remain  optimal.  Indeed,  the  same  must 
be  true  for  K  in  sane  right-hand  neighborhood  of  K*  ,  for  while  IR  -  0, 
I_  6  K  >  0  for  K  m  K*  and,  by  continuity,  I  >  0  for  K*  <  K  <  K 

IV  m  a  m 

for  some  K  (which  might  even  be  ♦  •) .  On  the  optimal  path  K  decreases 
in  this  interval,  and  therefore  K  never  goes  outside  the  Interval,  so 
that  the  optimal  path  for  the  reversible  Ramsey  path  is  still  feasible 
and  therefore  optimal  for  K  <  K. 

The  general  method  for  finding  the  optimal  strategy  can  now  be 
sketched.  As  before,  we  are  interested  in  the  differential  equation 
defining  p'(K)  •  dp/dK.  From  (48)  and  (54), 

(58)  P’OO  -  t(o  ♦  5)  P  -  qF’(K))/  (1-6  K). 

Here  q  and  I  can  be  determined  as  functions  of  K  and  p  from  (55) 
and  (56).  In  the  neighborhood  of  the  equilibrium,  the  solution,  as  noted, 
is  the  same  as  for  the  reversible  case.  From  (55)  and  (56),  (58)  special¬ 
ises  in  the  two  kinds  of  intervals  as  follows: 
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(59)  p'(K)  -  PIP  +  6  -  F^KjJ/fFCK)  -  6  K  -  c(p) ]  in  a  free  interval 

(60)  P'(K)  *  [(e  +  6)  p  -  U'[F(K)]  F’(K)]/(-6  K)  in  a  blocked 
interval. 

Let 

(61)  r  *  U’[F(K)]/p  • 

From  (55)  and  (56), 

(62)  r  <  1  on  a  free  interval,  r  >  1  on  a  blocked  interval. 

We  know  there  is  a  free  interval,  <0,K>,  with  K  >  K*.  Wc  there¬ 
fore  solve  (59)  around  the  equilibrium  and  continue  it  first  for  all 
smaller  values  of  K.  Then  continue  it  for  larger  values  of  K  until 
r  reaches  1  The  K-vaiu**  where  this  occurs  is  K,  and  there  is  a 
calculated  value  of  p,  p(K).  We  then  solve  (60)  with  this  starting 
point  until  r  comes  down  to  I  from  above.  At  this  point  we  start  a  new 
free  interval,  and  solve  (59),  but  with  the  starting  point  being  that 
achieved  at  the  end  of  the  previous  blocked  interval.  This  process  can 
be  continued  indefinitely. 

Thus,  the  problem  is  capable  of  numerically  meaningful  solution. 
Analytically  sharper  characterization  cannot  be  obtained  in  general, 
though  more  specific  hypotheses  imply  acme  limits  on  the  numbers  of 
blocked  and  free  intervals.  In  particular,  though,  it  can  be  shown  that 
It  is  possible  to  have  a  denumerable  or  arbitrarily  large  finite  number 
of  alternations  between  free  and  blocked  Intervals-  For  these  and  other 
results,  see  Arrow  and  Kurz  (19671- 

A»*':oer  modification  of  tbe  Ramsey  model  consists  of  allowing  for 
growth  in  population  and  labor  force  and  for  technological  change. 

Under  certain  simple  but  by  no  means  absurd  assumptions,  these  factors 
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can  be  Introduced  into  the  Ramsey  model  by  a  simple  reinterpretation  of 
variables. 


V 


Population  by  itself  affects  the  utility  functional.  Let  N(t)  be 
the  number  of  Individuals  at  time  t.  Assume  for  simplicity  that  the 
aggregate  amount  of  consumption  at  any  time  t  is  divided  equally  among 
the  existing  population.  Assume  also  that  each  individual  has  the  same 
felicity  function.  Then  the  felicity  of  any  individual  at  time  t  is 
U[C(t)/H(t)].  Since  there  are  N(t)  individuals,  it  is  reasonable  to 
conclude  that  the  total  felicity  of  society  at  time  t  is 
ll(t)  U[C(t)/N(t)],  and  the  utility  functional  then  is 

(63)  J^*  e"0tN(t)  U(C(t)/N(t)]dt. 

The  production  posalbilities  of  society  are  of  course  influenced  by 
the  size  of  the  labor  force.  This  effect  has  been  Ignored  until  now 
because  the  labor  force  has  been  assumed  constant.  The  growth  of  the 
labor  force  is  roughly  proportional  to  that  of  population,  but  it  will 
be  convenient  to  Ignore  this  relation  for  the  moment.  Ue  assume  in  any 
case  that  the  size  of  the  labor  force  is  a  known  function  of  time, 
Independent  of  the  instruments  or  the  state  variables-  Let  L(t)  be  the 
number  of  workers  at  time  t.  For  any  given  supplies  of  capital  and  labor, 
output  is  determined  by  the  production  function 

(6k)  Y  -  F(K,L), 

where  it  is  assumed  that 

(65)  F  is  concave  and  homogeneous  of  degree  1,  and  F(0,L)  ■  0. 

The  property  of  homogeneity  of  degree  1  is  known  to  economists  as 
constant  returns  to  scale;  if  labor  and  capital  are  varied  in  the  same 
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proportion,  then  the  same  productive  methods  can  be  employed,  with  only 


theor  scale  changed,  and  therefore  output  can  be  changed  in  the  same 
proportion.  HSiis  assumption  is  not  fully  true  but  may  be  accepted  as  an 
approximation.  The  assumption  F(0,L)  =  0  amounts  to  saying  that  capital 
is  indispensable  in  production. 

The  transition  equation  of  the  system  is  still 
« 

(48)  K  =  I  -  6  K. 

The  constraint  on  the  instruments,  C  and  I,  is  now 

(66)  F(K,L)  >  C  +  I. 

Technological  progress  can  be  stated  formally  as 

(67)  Y  *  F(K,L,t), 

that  is,  the  output  obtainable  from  fixed  amounts  of  capital  and  labor 
varies  with  time,  presumably  increasing.  A  particular  hypothesis  about 
technological  progress  for  which  there  is  seme  evidence  is  that  it  is 
labor-augmenting,  which  has  the  more  Specific  form: 

Y  «  F[K,  A(t)  L], 

that  is,  each  worker  at  time  t  can  do,  in  every  way,  exactly  what  A(t) 
workers  could  do  at  time  0.  In  this  form,  however,  we  can  see  that  we 
may  as  well  retain  (64)  where,  however,  it  is  understood  that  L  now 
represents  not  the  number  of  workers  in  the  usual  sense  but  the  number 
of  efficiency-equivalent  workers.  Thus,  in  the  new  definition,  L  can 
and  usually  will  be  increasing  more  rapidly  than  H. 

The  Lagranglan  is 

(60)  H  -  N(t)  U(C(t)/N(t)]  +  p(l  -  5  K)  ♦  q[F(K,L)  -  C  -  I). 
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The  necessary  conditions,  with  I  unrestricted  as  to  sign,  become 

(69)  p  -  q,  U'[C(t)/N(t)]  .  p  , 

(70)  p  -  pCp  ♦  &-{*/&)]• 

The  system  looks  much  as  it  did  before,  but  it  is  not  autonomous 
since  time  enters  explicitly  through  N(t)  and  through  L(t)  in  &F/&K. 
It  is  possible  to  use  a  non-autonomous  system,  but  autonomous  systems  are 
much  more  convenient;  with  appropriate  changes  of  variables,  together  with 
additional  assumptions,  it  is  possible  to  state  the  system  in  autonomous 
form. 

Since  labor  is  growing  we  can  hardly  expect  an  equilibrium  in  terms 
of  the  original  variables,  but  it  is  reasonable  to  suppose  there  will  be 
one  in  terms  of  ratios  to  the  labor  force.  Divide  all  variables  by  L, 
and  let  small  letters  denote  the  resulting  intensive  magnitudes: 

(71)  o  -  C/L,  k  -  K/L,  i  -  I/L. 

Let 

(72)  f(k)  -  F(k,l). 

Then,  from  (65), 

(73)  f(k)  is  concave,  f(0)  ■  0, 

(74)  F(K,L)  «  LF(K/L,l)m  Lf(K/L), 

so  that  f(k)  expresses  the  output  per  (effective)  worker  as  a  function 
of  the  capital  per  worker.  Differentiate  (74)  partially  with  respect  to 
K. 

(75)  rK  .  Ki/L)  f *(k/l)  .  f’(K/L). 


(70)  can  then  be  written 


(76)  p/p  «  P  +  6  -  f'(k). 

Since  log  k  ■  log  K  -  log  L  (natural  logarithms), 

•  «  • 
k/k  «  (K/K)  -  {L/L)  • 

Multiply  through  by  k,  note  that  k/K  «  l/L,  substitute  from  (48), 
and  use  the  definitions  (71). 

(77)  k  *  i  -  (7  ♦  5)k, 
where 

(78)  7  -  L/L, 

frequently  referred  to  as  the  natural  rate  of  growth  of  the  economy 
(remember  that  L  has  b?en  so  defined  as  to  reflect  technical  progress 
as  well  as  labor  force  growth).  Divide  through  in  (66)  by  L,  and  use 
the  definition  (71)  and  (72). 

(79)  f(k)  >  c  ♦  i. 

The  equality  will  certainly  always  hold  in  (79)-  Elimination  of  i 
between  (77)  and  (79)  yields 

(80)  k  -  f(k)  -  (7  ♦  6)k-c. 

Now  define 

(81)  g(k)  .  f(k)  -  (7  ♦  6)k; 

from  (73)#  g(k)  is  concave,  g(0)  •  0.  Then  (76)  and  (00)  can  be 
written: 

(82)  p/p  •  P  •  7  -  g*(k), 

(83)  k  •  g(k)  -  c. 
finally,  (69)  can  be  written 

(84)  U’t(iyn)c).  p. 
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The  system  (82-64)  would  be  autonomous  if  the  following  two  condi¬ 
tions  are  satisfied: 

(85)  7  constant, 

(86)  L(t)/lf(t)  constant. 

This  is  the  case  of  no  technological  progress  and  a  constant  rate  of 
population  and  labor  force  growth,  lfcen  the  equations  have  exactly  the 
same  form  as  those  for  the  Ramsey  model,  with  K,  C,  F(K),  and  p  replaced 
by  k,  c,  g(k),  and  p  -  7,  respectively.  Tbe  importance  of  the  last 
substitution  must  be  stressed.  The  optimality  analysis  of  the  Ramsey 
case  made  use  of  the  hypothesis,  p  >  0  to  show  that  the  transversality 
conditions  were  satisfied.  This  condition  seems  reasonable.  But  in  the 
case  of  growth,  the  corresponding  condition  is  p  >  7;  this  is  somewhat 
odd  because  the  value  of  p  is  a  value  Judgment  while  that  of  7  is  an 
empirical  fact.  There  seems  no  Intrinsic  reason  why  the  Inequality  should 
hold  in  one  direction  or  the  other. 

It  must  be  remarked,  moreover,  that  the  hypothesis  cannot  be  es¬ 
sentially  weakened.  In  the  Ramsey  model  without  growth  it  can  be  shown 
that  if  p  <  0,  there  is  no  optimal  path  in  any  meaningful  sense;  for  a 
detailed  analysis  see  Xoopmans  (198?,  p-  251-2  and  279-95 );  as  Just  seen, 
the  same  result  holds  if  the  economy  is  growing  at  a  cor. s tan >  rate  7  and 
p  <  7.  The  borderline  case,  p  «  0  in  the  model  without  growth  or  p  ■  7 
in  a  growing  economy,  has  been  studied  in  considerable  detail  by  Hussey 
(1926),  Koopmane  (1965,  PP-  239-43  and  269-75),  von  WeUsfcker  (1965]» 
Alternative  definitions  of  optimality  are  possible  since  the  utility 
functional  need  not  converge,  and  in  general  the  existence  of  an  optimal 
program  in  the  borderline  case  depends  on  the  specific  properties  of  the 


production  function. 

lb  allow  for  technological  progress,  we  wish  to  relax  (86)  and  allow 
the  ratio  L(t)/N(u)  to  be  increasing.  We  still  wish  to  arrive  at  an 
autonomous  system.  In  the  system  (82-84)  it  is  (84)  which  will  no  longer 
be  autonomous.  In  general  there  is  no  transformation  of  the  variables  in 
(82-84)  which  will  make  the  system  autonomous,  but  such  a  transformation 
is  possible  if  U'  is  homogeneous  of  some  degree.  Note  that  U'  must 
be  decreasing;  therefore  it  must  be  homogeneous  of  some  negative  degree, 
say  -ff. 

(8?)  Assume  that  U'(c)  is  homogeneous  of  degree  -a,  a  >  0. 

We  also  assume,  to  replace  (86), 

(88)  L(t)/N(t)  has  a  constant  rate  of  growth,  t, 

which  may  be  interpreted  as  the  rate  of  (labor-augmenting)  technological 
progress. 

From  (87)  and  (84), 

i«t)/.Kt)r<’  u  (.) .  P, 

or 

U‘(c)  .  PlL(t)/N(t))a. 

In  an  effort  to  reach  an  autonomous  system  it  is  then  a  good  idea  to 
define 

:s*>  j.  piKo/moi". 

so  that 

(90)  U'(c)  .  ?, 

and  then  seek  a  differential  equation  for  p  to  replace  (82),  Take  the 
logarithm  of  both  sides  in  (89).  differentiate  with  respect  to  time  and 


substitute  from  (82)  and  (88). 

(91)  p/p  =  P  +  (jt-7  -  g'(k). 

*he  system  of  equations  (83),  (91),  and  (90)  is  now  again  of  the  same 
form  as  the  Ramsey  model,  with  K,  C,  p,  F(K),  and  p  being  replaced  by 
c,  p,  g(k),  and  (p  +  a  x)  -  7,  respectively.  Ihe  last  conditions 
mean  that  for  optimality  we  need 

(92)  p  +  a  X  >  7, 

with  some  possible  cases  of  optimality  when  equality  holds.  It  is  also 
worth  noting  that,  from  (91),  the  equilibrium  capital-labor  ratio,  k°°, 
is  defined  by 

g'(0  =  p  +  a  x  -  7. 

From  (8l),  this  can  be  written 

(93)  f ' (k”)  -  6  =  p  +  a  x. 

The  left-hand  side  Is  thus  the  equilibrium  net  marginal  productivity  of 
capital  (net  of  depreciation,  that  is)  and  so,  in  usual  economic  terminology, 
the  right-hand  side  is  an  equilibrium  rate  of  interest. 

Remark  1.  The  existence  condition  (92)  amounts  to  saying  that  the 
equilibrium  rate  of  interest  exceeds  the  rate  of  growth. 

Remark  2.  ''’he  equilibrium  rate  of  interest  is  higher,  the  higher 

the  rate  of  technological  progress.  Notice  also  that  if  x  =  0,  then  the 
entire  equilibrium  does  not  depend  in  any  way  on  the  felicity  function 
but  only  on  the  production  function  and  the  utility  rate  of  discount,  p. 

With  technological  progress,  on  the  other  hand,  this  ceases  to  be  true} 
other  things  being  equal,  the  marginal  productivity  of  capital  is  higher 
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(and  therefore  the  capital-labor  ratio,  k*,  is  smaller)  the  higher  a, 
i.e.,  the  more  rapidly  the  individual  becomes  surfeited  with  goods. 

Remark  3-  Note  also  that  c  is  consumption  per  effective  worker, 
not  consumption  per  capita.  As  the  optimal  path  converges,  c  converges 
to  a  limit;  but  since  L/N  increases  at  the  constant  rate  t,  it  follows 
that  asymptotically  consumption  per  capita  will  grow  exponentially  at  the 
rate  t. 
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Lecture  5 


Optimal  Growth  in  a  Dual  Economy 

It  is  a  common  hypothesis  among  economists  that  in  underdeveloped 
countries  there  exist  side-by-side  two  economic  systems,  one  advanced 
and  the  other  backward.  The  economic  significance  of  this  separation 
is  that  workers  in  the  advanced  economy  receive  a  wage  which  may  be 
much  higher  than  anything  received  in  the  backward  sector.  At  the 
same  time,  it  is  assumed  that  these  workers  save  nothing,  so  that  any 
capital  accumulation  must  come  out  of  the  surplus  of  output  over  wage 
payments.  For  simplicity,  assume  there  is  no  relevant  product  at  all 
in  the  backward  sector.  It  still  may  not  be  optimal  for  the  economy  to 
have  full  employment  of  the  labor  force  in  the  advanced  section;  each 
additional  worker  creates  more  product,  on  the  one  hand,  and  a  claim  to 
a  fixed  portion  of  that  product  on  the  other.  Thus  capital  accumulation 
might  be  lower  under  full  employment  than  with  some  unemployment. 

For  simplicity,  it  is  assumed  here  that  the  population  and  available 
labor  force  are  constant  and  that  there  is  no  technological  progress; 
generalization  in  these  directions  can  easily  be  carried  out  by  the 
methods  of  the  last  lecture.  The  following  discussion  is  based  on  the 
work  of  Marglin  [1966]  and  Dixit  [1967].  The  Ramsey  model  is  modified 
by  adding  one  instrument  and  two  constraints.  The  additional  instrument 
is  the  amount  of  labor  to  be  employed,  L;  the  additional  constraints 
are  that  there  is  a  fixed  parameter,  w  (wage  rate  in  terms  of  goods), 
such  that, 

(94)  C  -  wL  -  0, 

and  that  the  amount  of  labor  employed  not  exceed  the  fixed  amount 
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available . 


(95)  L  -  L  =  0. 

Otherwise,  the  Ramsey  conditions  remain: 

(24)  maximize  J**  e"p  t  U[C(t ) ]  dt, 

o 

(28)  K  =  I, 

(66)  F(K,L)  -  C  -  I  =  0; 

(66)  is  substituted  for  (29)  since  the  labor  force  is  a  variable  of  the 
problem;  the  function  F  is  assumed  to  satisfy  (65). 

The  Lagrangian  can  be  written, 

(96)  U(C)  +  pi  +  [F(K,L)  -C  -I]  +  qg  (C-wL)  +  ^  (I^L). 

Equate  to  zero  the  derivatives  of  the  Lagrangian  with  respect  to 

the  three  instruments,  C,  I,  and  L. 

U'(C)  =  qL-  q2,  P  =  q^  FL(K,L)  =  %  w  +  q?, 
or, 

(97)  U«(C)  *  p  -  q2, 

(98)  PFL  *  qgW  + 

where  F^  »  8  F/d  L.  Of  course, 

(99)  q2  ■  0,  qg(C-wL)  =0;  q^  «  0,  q^(^-L;  »  0. 

Since  the  constraint  (66)  is  certainly  effective,  (28)  and  (66) 

imply, 

(100)  K  «  F(K,L)  -  C. 

The  auxiliary  equation,  as  before,  is 

(101)  i/ p  «  p  -  fk(k,l). 

From  (100)  and  (lOl),  at  an  equilibrium, 

(102)  F^L")  -  p,  C®  -  F(K*,L"). 

To  be  an  equilibrium  of  this  system,  however,  (94)  must  be  satisfied. 
Since  F  is  homogeneous  of  degree  1,  it  is  easy  to  prove  that  is 
homogeneous  of  degree  0;  the  first  equation  in  (102)  can  therefore  be 
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solved  for  K*/L*.  Write  the  second  equation  as, 

c* /C  -  f(k7l°°,i), 

since  F(1C,L)  is  homogeneous  of  degree  1.  We  will  assume  then  that, 

(103)  C*/ L*  >  w. 

Then  the  constraint  (9b)  is  not  binding  at  equilibrium,  and 
qg**  ■  0.  Then,  ffoo  (96) >  q^"  >  0,  so  that  (95)  is  binding,  i.e., 

tbeie  is  full  employment .  Thus,  for  K  in  the  neighborhood  of  K°°, 
the  optimal  path  is  Identical  with  that  for  the  Ramsey  problem.  Since, 
in  the  Ramsey  problem,  p  is  a  decreasing  function  of  K,  and  therefore 
C  is  an  Increasing  function  of  K,  it  follows  that  the  constraint  (9*0 
is  fulfilled  emd  ineffective  for  K  «  K*.  It  follows  that  there  is 
X  <  K^*  such  that  the  optimal  solution  for  the  dual  economy  coincides 
with  the  Ramsey  solution  in  the  interval  <  K  +  »  >,  which  will  be 
termed  interval  I. 

K  is  defined  by  the  condition  that  (9b)  becomes  effective  there. 
Since  p(K)  is  the  same  as  for  the  Ramsey  solution  for  K  ■  K,  it  is 
new  known  for  K  ■  K.  Also,  q^(^)  ■  p(K)  FL(l?,r)  >0.  As  K  decreases 
below  K,  it  must  be  that  remains  positive,  at  least  for  some  inter¬ 
val,  while  «jg  rises  above  C.  Then  constraints  (9b)  and  (95)  are  both 
effective  in  an  interval  to  the  left  of  K  —  termed  interval  II,  in  which 
C  •  wl,  K  -  ?(K,Z)  -  wL,  so  that,  from  (101), 

(103)  dj/dK  -  p(p  -  FK(K,Dl/lF(K,I)  -  wl)  in  interval  II. 

Since  p(?)  is  known,  this  equation  can  be  solved  rather  easily  for 
smaller  values  of  K. 

Also  In  interval  II,  qg  ■  p  -  U*(wl),  from  (97),  so  that,  from  (96), 
(10b)  q5  -  p(FL(K,D  -  w)  ♦  wU'(wt). 

Thus  the  lower  end  of  interval  II  is  defined  by  the  condition  q^  »  O. 
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so  that. 


Since  F(0,L)  =  0,  all  L,  by  (65),  there  exists 

-  wl. 

As  K  approaches  +  0,  the  denominator  of  (103)  is  asymptotically 
equivalent  to  (K  -  Kj.),  so  that  clearly  ?(K)  approaches 

infinity.  Also,  from  Euler's  theorem  on  homogeneous  functions, 
vL  =  F(Kj,t)  =  FJK^I)  L  +  F^K^L)  Kx  >  FjK^L)  L, 
so  that  F^(K^,L)  <  w.  The  first  term  of  (id*)  then  approaches  -  *, 
while  the  second  is  constant.  Hence,  q^(K)  *  0  for  some  K  >  0. 

Interval  III  is  the  interval  <0,  K>.  In  this  interval,  the  full 
employment  condition,  (95),  ceases  to  be  binding,  and  q^  *  0.  From 
(9*0,  (97),  and  (98),  we  deduce, 

(105)  U'(vL)  .  p  :fl  -  (FL(K,l)/vj)  . 

which  defines  L  as  a  function  of  K  and  p.  The  basic  differential 
equation  takes  the  form  in  interval  III, 

(106)  dv/dK  -  p[p  -  fk(v.,l)1/[f(k,l)  -  wL], 

It  is  to  be  noted  thst  dl/dK  >  0  in  this  interval  (the  more 
capital,  the  more  labor  can  be  employed) .  This  means  that,  as  ve  push 
the  solution  to  lower  values  of  X,  the  full  employment  constraint  will 
never  become  binding  again.  To  see  that  dl/dX  >  0  in  interval  III, 
first  note,  from  (105)  that 

1-  [Fl(K,L)/v]  >  0  in  interval  III, 

Differentiate  (10$)  totally  with  respect  to.  K  and  group  terms. 

(107)  tU"(wL)  w  ♦  (j/v)  Fjj/K,!)]  (dl/dK)  -  -  (j/v)  F^L) 

Hi  -  (FL(K,L)/w))(d|^dK). 

From  the  concavity  of  U  and  F,  it  follows  thst  U"  <  0,  FTT  <  0. 

LL 
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Since  is  homogeneous  of  degree  0, 

Fu  1  +  fu  K  ■  °- 

by  Euler's  theorem;  but  since  F^L  <  0,  and  L,  K  >  0,  F^  >  0. 

It  is  then  easy  to  calculate,  from  (107), 

(108)  if  dp/dK  <  0,  then  dl/dK  >  0  in  interval  III. 

Since  K  <  K*0,  F^(K,L)  >  F^(K  ,L)  =  p,  so  that  p'(K)  <  0.  Suppose 
p*(K#)  *  0,  for  some  K*,  0  <  K*  <  K.  Take  the  largest  such.  Then 
p'(K)  <  0,  K*  <  K  *  K,  so  that  F^(K,L)  >  p  in  that  interval,  or, 

K/L  <  K7L  for  K*  <  K  =  K, 

while  K*/L*  -  K*/L  •  Since  F^  >  0,  F^  increases  with  K  for  fixed 
L;  but  since  F^  is  a  function  of  K/L,  Fl  increases  with  K/L.  Hence, 
FL(K*,L*)  >  FL(K,L)  for  K  in  a  right-hand  neighborhood  of  K*, 
where,  it  will  be  recalled,  L  is  a  function  of  K  defined  by  (105), 
and  L*  is  its  value  at  K  *  K*.  Therefore, 
dFj/dK  -  0  at  K  «  K*. 

&it,  dFj/dK  *»  Fj^  (dl/dK)  +  Fjjj.  Compute  dl/dK  from  (107),  and  recall 
that  di/dK  -  0  at  K  «  K*.  Then, 

dFL/<tt  -  F^  (K*,L*)  U"(vL*)  w/[U"(vL*)w  +  (p/w^K*,!*)  ]  >  0  at  K=K*, 
a  contradiction.  Hence,  p'(K)  <  0  for  0  <  K  <  K;  by  (108)  L  is  an 
increasing  function  of  K  in  interval  III  (capital  permits  employment), 
and  consumption  is  proportional  to  L. 
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Analysis  of  a  One  Good  Model  of  Economic  Development 

1.  Introduction 

In  these  notes  we  are  going  to  analyze  an  idealized, 
or  better,  an  imaginary  economy  in  which  there  is  only 
one  good.  This  good  can  be  used  for  two  purposes; 

(A)  it  can  be  consumed,  thus  creating  satisfaction  or 
utility  for  the  people  who  consume  it,  or  (B)  it  can 
be  Invested,  in  which  case  it  creates  additional  amounts 
of  itself.  We  will  be  concerned  with  an  operation  of 
this  economy  throughout  time  and  therefore  the  problem 
at  each  instant  will  be  to  decide  how  much  to  consume 
and  how  much  to  Invest  in  order  to  maximize  utility 
throughout  time  in  some  suitably  defined  sense. 

What  is  the  purpose  in  considering  this  sort  of 
imaginary  situation  which  bears  little  resemblance  to 
any  actual  economy,  living  or  dead?  The  answer  is  that 
in  analyzing  this  model  we  shall  run  into  certain 
mathematical  and  economic  techniques  which  turn  out  to 
be  basic  not  only  for  the  study  of  this  make-believe 
economy  but  also  for  the  more  realistic  {but  more 
complicated)  models  which  may  come  up  in  practice.  Our 
aim  is  thus  to  Isolate  this  technique  in  a  simple  context. 
The  technique  we  refer  to  is  what  economists  describe  as 


the  use  of  a  price  system  and  what  mathematicians  refer 
to  as  the  method  of  dual  variables .  By  whatever  name 
one  calls  it,  this  subject  is  the  central  one  both  in 
economic  analysis  and  modern  optimization  theory. 
Mathematically  it  enables  one  to  answer  such  questions 
about  optimal  development  programs  as:  do  they  exist? 

Are  they  unique?  What  are  their  qualitative  properties? 
Economically  it  allows  one  to  give  a  competative  market 
interpretation  to  these  optimal  paths  along  which,  it 
turns  out,  producers  are  maximizing  profits  and  consumers 
are  maximizing  utility  subject  to  their  budgetary 
limitations. 

The  above  is  a  rough  preview  of  what  will  be  found 
in  the  rest  of  these  notes.  The  overture  is  now  ended 
and  the  show  will  begin. 

2.  The  Model;  Finite  Time  Horizon 

The  model  will  involve  a  single  commodity  which  we 
will  refer  to  as  "goods".  It  is  described  by  two  functions, 
a  Production  function  ffc(x)  and  a  utility  function  ut(c) 
where  i‘ ( x )  is  the  amount  of  goods  produced  at  time 
t+1  from  an  investment  of  x  units  of  goods  at  time  t, 
and  ut(c)  is  the  satisfaction  gained  by  consuming  c 
units  of  goods  at  time  t.  The  domain  of  t  is  the 
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non-negative  integers  and  that  of  x  and  c  the 
non-negative  reals. 

DEFINITION  1.  A  program  with  initial  stocks  s  Is  a 
sequence  of  pairs  <  xt,ct  >  finite  or  infinite  such 
that 

(2.1)  cQ  =  s  -  xQ 

(2.2)  ct  =  ft.1(xt_1)  —  xt  for  t  >  0. 

If  <  x^,ct  >  is  a  program  the  corresponding 

utility  sequence  is  given  by  <  ut(ct)  >. 

Clearly,  conditions  (l)  and  (2)  state  that  the  sum 

of  consumption  and  investment  in  period  t  is  equal  to 

the  amount  produced  in  the  previous  period.  If  the 

sequence  <  xt,ct  >  is  finite  with  t  =  1,*--,T  then 

it  is  called  a  T-perlod  program  and  if  fT(xT)  =  s’  we 

refer  to  the  program  as  a  T-period  program  with  initial 

stocks  s,  and  final  stocks  s*  or,  more  briefly,  a 

T-perlod  program  from  £  to.  s^..  The  value  of  such  a 
T 

program  is  2  u.(cj. 

t=0  z  z 

DEFINITION  2.  A  T-period  program  from  s  to  s’  is 
called  optimal  if  it  has  maximum  value  among  all  such 
programs . 

Although  our  principal  interest  will  be  in  infinite 
rather  than  finite  programs  it  will  be  necessary  first 


4 

to  develop  the  basic  properties  of  finite  optimal 
programs . 

V/e  now  introduce  the  central  concept  of  these  notes. 

DEFINITION  J.  The  program  <  xt,cfc  >  is  called 
competitive  if  there  exist  non-negative  numbers  (prices) 

Pt  such  that 

(A)  ut(c)  —  p^c  is  maximized  at  cfc  for  all  t, 

(B)  “  Ptx  is  maxirnized  at  for  a1^  t • 

These  conditions  have  an  important  economic  Interpre¬ 
tation.  Regarding  pt  as  prices  we  see  that  pfcx  is  the 
cost  of  investing  x  units  at  time  t,  while  Pt+ift^x^ 
is  the  return  or  value  of  ft(x)  units  at  time  t+1. 

The  difference,  therefore,  represents  profit  and  condition 
(B)  requires  that  Investment  be  chosen  at  each  time  t 
so  as  to  maximize  profits. 

To  motivate  condition  (A)  we  note  from  (2.2)  that 

ptct  =  Pt^t-l^t-l*  “ 

and  the  right  hand  side  here  might  be  thought  of  as 
disposable  income  since  it  represents  the  value  of  goods 
just  produced  minus  the  cost  of  goods  to  be  invested. 

If  we  then  require  consumers  to  spend  no  more  than  the 
amount  pfcct  (budget  constraint)  condition  (A)  says 
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that  consumers  will  then  consume  so  as  to  maximize  their 
utility  subject  to  this  constraint. 

The  following  simple  result  is  the  starting  point 
for  the  theory. 


THEOREM  1.  If  <  xt,ct  >  ls  a  T"Period  Program  from 
s  to  s'  which  is  competitive,  then  it  is  optimal. 

Proof.  Let  (Pt)>  t  =  1, -‘-jT+l  be  the  competitive 
prices  and  let  <  x^c^  >  be  any  other  program  from  s 
to  s'.  Then  from  (A)  and  (2.1)  and  (2.2) 

u0(Cq)-u(c0)  <  P0(c^-c0)  =  pq(s-Xq)  -p0(s-x0)  *  “P0xo  +  P0xo» 

u^ (c t )  —u ( c t )  ^  Pt(ct-ct)  **  P^ ( ^xt~l ^ *"xt ^ "*Pt ^t “1  ^t“l ^ ~xt ^ 
for  t  =  1, •  •  »,T 

and  0  =  pT+1(s,-s>)  =  PT+1fT(xT)  -  p^^tx^J,), 

and  summing  on  t  gives 
T  T 

.^L^ut^ct^~ut^ct^  =  .2  ^Pt+lft^xt^“ptxt^“^pt+lft^xt^"ptxt^ 
t=Q  t*0 

where  we  have  collected  terms  in  x^..  But  since  each 
term  in  the  sum  on  the  right  hand  side  above  is  non- 
positive  from  (B),  it  follows  that 


2  ut(c^)  “2ut(ct)  <  0 
so  £ut(ct)  ls  a  rnaximuin  as  asserted. 


e 


I 


What  we  have  here  shown  is  that  competitive  programs 
are  optimal.  We  need  a  converse  to  this  theorem  and  for 
this  purpose  must  make  some  assumptions  about  the 
functions  f  and  u.  These  are 

(I)  The  function  f  is  non-negative,  concave 
and  increasing  in  x  (for  each  t)  and  ft(0)  *  0. 

(II)  The  function  u  is  concave  and  increasing 
In  c,  but  possibly  u^(0)  =  - 

The  last  condition  of  (13)  is  important  for  we  would 
like  to  permit  functions  such  as  u(c)  =  log  c,  c  >  0. 

The  condition  u(0)  =  -  <»  would  mean  that  to  consume 
nothing  (starvation)  is  "infinitely  bad".  Unless 
otherwise  stated  it  will  be  assumed  henceforth  that 
conditions  (I)  and  (II)  are  satisfied. 

We  now  recall  the  fundamental  mathematical  result 
needed  for  this  work  (which  may  well  be  the  fundamental 
result  of  all  optimization  theory),  namely  the  Kuhn-Tucker 
Theorem.  We  can  get  by  with  the  following  weak  form: 

Kuhn-Tucker  Theorem.  Let  u(x)  and  f^x),  i  -  l,---,m, 
be  convex  functions  defined  on  a  convex  set  X  and  let 
x  minimize  u(x)  in  X  subject  to 


(2.3) 


ft(x)  <0,  i  -  1, • • • ,m. 


Then  if  (2.3)  has  a  strict  solution  there  exist  numbers 


>  0  such  that 
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(2.4)  u(x)  —  Sp^f^x)  is  minimized  at  x. 

Suggestion  for  a  Do-It-Yourself  Proof:  Let  Y  be 
the  set  of  all  y  =  (y^, ...,ym)  such  that  the  inequalities 

^(x)  i  yi 

have  a  solution.  Show  that  Y  is  convex  and  has  0  as 

an  interior  point  (here  we  use  the  strict  solution 

hypothesis).  Now  let  u(y)  «  min  u(x)  and  show  that 

*i(x)<yi 

U  is  a  convex  function  of  y.  Then  use 

the  fact  that  a  convex  function  <P  has  a  support  at 
every  interior  point  x  of  its  domain,  i.e.  there  is 
a  linear  function  p*x  3uch  that  p*(x-x)  <  <p(x)  —  <p(x) 
for  all  x  in  X.  The  support  of  (i  at  0  is  the  p 
we  are  looking  for. 

We  can  now  get  the  desired  converse  for  Theorem  1. 

THEOREM  2.  Let  <  xt'ct  *  be  an  optimal  program  from 
s  to  s'  and  assume 

(2.5)  ct  >  0  for  at  least  one  t. 

Then  <  xt,ct  >  is  competitive. 

Repiar^ .  Without  (2.5)  the  Theorem  would  not  be 
true.  Suppose  ft(x)  ■  px  for  some  fixed  p  (i.e.  f 
is  linear)  and  suppose  u(c)  «  log  c,  s  -  1,  s'  ■  p***. 
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Then  clearly  the  only  program  from  s  to  s’  is 
<  xt'ct  >  “  <  p^,0  >,  but  condition  (A)  requires  there 
exist  pt  such  that 

log  c  —  ptc  =  max  at  c  =  0 

and  clearly  no  such  exist.  The  fact  that  log  0  =  -  °° 

is  not  crucial  here.  The  same  situation  would  occur  for 
u(c)  *  */c  .  The  difficulty  comes  from  the  fact  that  the 
slope  of  u  is  infinite  at  c  =  0. 

Proof .  Replace  conditions  (2.1)  and  (2.2)  by 

c0  +  x0  “  s  i  0 

(2.6)  ct  +  xfc  -  ft-l^xt-l^  =  0  t  -  1,  •  •  *,T 

s 1  frp(xiji )  <  0 

Now  clearly  <  xt,ct  >  satisfies  (2.6)  and  it  also 
maximizes  2  ut(ct),  since  each  ufc  is  non-decreasing 
in  c.  Hence  the  Kuhn-Tuclcer  Theorem  applies  provided 
we  can  show  that  (2.6)  has  a  strict  solution.  Assuming 
this  for  the  moment  we  obtain  numbers  pfc  >  0,  t  ■  0, . . . , 

T  +  1  such  that 

T  T 

( 2  *  ? )  ^  ut  ^ c t ) ”p0  ^  c0+x0  ^ ” J S.  pt  ^  c t+xt ^ "ft -1 ^ x t -1 ^ ^+pT+lf T  ^XT ^ 
is  maximized  at  <  xt,ct  >.  Rearranging  (2.7)  gives 
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T  T 

^n^ut ~  ptCt^  +  ^pt+lft^Xt^  ~  PtXt^ 

u”  v/ 

is  maximized  at  <  ct,xt  >,  but  note  that  the  terms  of 
(2.8)  are  independent .  hence  (2.8)  is  maximized  at 
<  xt, ct  >  if  and  only  if  ut(ct)  —  pfcct  is  maximized 

at  and  Pt+lft^xt^  “  ptxt  maximizeci  at  xt 
for  all  t,  and  these  are  precisely  conditions  (A) 

and  (B) . 

To  show  that  (2.6)  has  a  strict  solution  we  consider 
the  new  program  <  xt,0  >  and  note  that  we  have 

Xq  —  s  <  0 

(2.9)  *t  -  i  0 

S*  *“  fip(X|ji)  <  0 

and  at  least  one  of  the  above  inequalities  is  strict  by 
assumption  (2.5).  We  therefore  reduce  the  problem  to 
the  following: 

LEMMA  1.  If  (2.9)  has  a  solution  with  one  strict 
inequality  then  it  has  a  strict  solution. 

Proof.  Induction  on  T.  If  T  ■  0  we  have 

x0  -  »  <  0 
S'  -  r(x0)  <  0 

If  x0  -  8  <  0  then  by  slightly  increasing  xQ  if 
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necesoary  we  can  assure  that  s*  —  f(xQ)  <  0  also,  since 
f  is  increasing.  If  s'  —  f(xQ)  <  0  then  by  slightly 
decreasing  x^  we  can  assure  xQ  —  s  <  0  as  well. 

Now  suppose  one  of  the  inequalities  (2.9)  is 
strict  for  some  tQ  >  0.  Then  by  induction  hypothesis 
there  is  a  solution  x^.  giving  strict  inequality  for 
all  but  the  first  inequality,  and  to  get  a  strict 
solution  we  slightly  decrease  x^  if  necessary.  In 
the  other  case  we  have  xQ  —  s  <  0  so  inductively  there 
is  a  solution  xj.  satisfying  all  but  the  last  inequality 
strictly  and  this  will  be  satisfied  too  by  a  slight 
increase  in  xJJ,  . 

The  fact  that  the  class  of  optimal  and  competitive 
programs  are  identical  is  of  economic  interest  in  itself 
as  it  shows  that  if  the  "prices  are  right"  optimality 
is  attained  by  allowing  producers  and  consumers  to  act 
purely  selfishly  and  maximize  profits  and  utility 
respectively.  We  shall  now  show  how  the  price  theorem 
can  be  used  to  gain  qualitative  information  about  the 
nature  of  optimal  programs.  For  the  rest  of  this 
section  we  will  assume  that  the  functions  f  and  u 
are  Independent  of  the  time. 

DEFINITION  4.  The  function  f  will  be  called  productive 
for  any  x  >0,  h  >  0,  f(x+h)  >  f(x)  +  h.  In  words. 
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increasing  the  input  by  some  amount  will  increase  the 
output  by  more  than  that  amount.  For  f  differentiable 
this  is  equivalent  to  f ’ (x)  >  1. 

THEOREM  3.  If  f  is  productive  then  for  any  competitive 
program  <  *tjCt;pt  > 

(a)  prices  pt  are  positive  and  decreasing  in  t. 

(b)  consumption  (and  hence  utility)  is 

non-decreasing  in  t. 

(c)  stocks  xt  are  non -decreasing  up  to  some  time 
tQ  and  decreasing  thereafter. 

Proof.  (a)  V/e  first  note  from  (A)  that  ct 
maximizes  u(c)  -  p^c.  This  shows  that  pt  >  0  since 
otherwise  u,  being  increasing,  would  have  no  maximum. 
Next,  from  (B) 

pt+if(xt>  ~  ptxt  i  pt+if(x)  ~  ptx  for  *n  *  i  ° 
pt  f(x)  -  f(xt) 

or  : - — *-  for  all  x  >  x. 

pt+l  *  *  xt  1 

but  since  f  is  productive  the  right  hand  side  above 

is  greater  than  1,  hence  pt+1  <  pfc. 

(b)  From  (A) 

u('t)  -u(«t+1>  i  Pt(ct  _<W 

u(ct+1)  -U(ct)  i  Pt+X(ct+1  -ct) 
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hence 


0 1  (pt  ->W(ct  -<W 

(this  relation  is  sometines  called  La  Chatelier's 
principle,  I  think).  But  from  (a)  pfc  “  Pt+^  >  0  hence 
cfc  —  ct+^  <  0  as  asserted. 

(c)  It  will  suffice  to  show  that  if  <  x.  .  then 


xt+i  *  *t-  Now 


xt+i  •*»  ■  f(xt>  -f<xt-i>  ■ct)  i  r;Tt>  •f(xt-i)  from  'b) 


Prom  (B) 


Ptf(xt)  -  <  Ptf*xt-r  “  pt-lxfc-l 

BO  f(xt)  -  ftx,.^)  <  (pt./pt)(xt  -X,-.!)  <  0 

so  xt+l  -  xt  <  °- 


3-  Infinite  Programs. 

The  finite  horizon  programs  are  not  of  great  interest 
in  economic  development.  It  is  true  that  if  one  were 
devising,  say  a  five  year  plan  and  had  decided  on  the 
final  stocks  s'  then  it  would  be  natural  to  try  to 
solve  the  problem  of  the  previous  section.  However,  the 
important  decision  would  in  this  caee  already  have  been 
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made,  namely  the  choice  of  s’ .  The  main  problem  in 
economic  planning  is  to  set  reasonable  goals  for 
capital  accumulation  and  it  appears  that  the  only  way  to 
attack  this  is  to  consider  infinite  programs.  The  first 
thing  needed  is  a  notion  of  optimality. 

DEFINITION  5.  If  <  xt,ct  >  and  <  xj.,cj.  >  are 
infinite  programs  we  say  that  <  xt,ct  >  overtakes 
<  >  if  there  exists  a  time  T  such  that 

T 1  T  * 

>  u+ic.)  >  7  u t-(cl)  for  all  T'  >  T. 

t-0  t*0  z  z 

We  say  that  <  xfc,ct  >  m  jfes.  <  x^,c£  > 

(at  infinity)  if 

T 

11m  >  u(c.J  -  u(cj  >  0. 

T'**  t-0  z 

A  program  will  be  called  9.EV&&1  (slCfiD&lX  gBSlflftl) 
if  it  catches  up  to  (overtakes)  every  other  program. 

We  remark  that  if  it  should  happen  that  the  series 

00 

E  u..{c. )  converge  for  all  programs  (as  may  occur,  for 
t-0  z  c 

instance  if  future  utilities  are  suitably  discounted) 
then  Definition  5  corresponds  to  choosing  as  the  optimal 
l^ograra  the  one  whose  utility  sum  Is  greatest.  Just  as 
in  the  finite  case.  However,  Definition  5  la  more  general 
for,  as  we  shall  see,  optimal  programs  in  this  broader 
aenae  may  exist  although  all  the  utility  series  are 
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divergent.  In  the  next  section  we  shell  give  a  specific 
rather  general  existence  theorem  for  the  case  when  u 
and  f  are  independent  of  the  time.  In  the  present 
section  we  shall  obtain  infinite  analogues  tc  Theorems 
1  and  2  relating  optimal  and  competitive  programs.  Note 
in  this  connection  that  the  definition  of  a  competitive 
program  requires  no  modification  for  the  infinite  case 
since  ccnditions  (A)  and  (B)  carry  over  as  given. 

THEOREM  4.  Any  optimal  program  <  xt,cfc  >  is  competitive. 

Proof.  We  first  dispose  of  a  trivial  case  in 
which  ct  =  0  for  all  but  a  finite  number  of  times  t. 

This  means  that  all  stocks  are  completely  consumed  by 
the  end  of  T  time  periods  for  some  T,  so  we  are 
back  in  the  finite  case  of  Theorem  2  where  the 
final  stocks  s'  are  zero. 

In  all  other  cases  c^.  >  0  for  infinitely  many  t. 

Note  that  if  ve  truncate  the  program  at  t  =  T  we  have 
an  optimal  T-period  program  (with  final  stocks  fT(xT) ), 
so  for  each  T  there  exist  prices  ptT  which  satisfy 
(A)  and  (B)  for  t  ^  T.  Denoting  by  the  set  of  all 

such  prices  one  verifies  that  is  a  closed  interval, 

possibly  unbounded  above,  of  non-negative  numbers.  Also 
^t  ^t  s*nce  if  (A)  and  (B)  are  satisfied  for 
t  <  T+l  they  are  satisfied  for  t  <  T,  so  it  remains 


to  show  IL  =  r\  lE  is  non-empty,  and  this  will  follow 
Z  T=1  Z 

from  the  nested  interval  theorem  if  we  can  show  that  for 

T 

any  t  there  exists  T  such  that  nt  is  bounded.  We 

first  note  that  is  bounded  if  c,p  >  0,  for  from  (A) 

T  T 

if  P-r  e  %  then 

m  t 

uT(cT)  PipC,p  )  PfpCip/2 

01*  Pip  £  2(Urj,(Cm)  Crp/2 )  )/Cip  • 

Now  for  any  t  from  (B) 

pt+lftW)  “  ptxt  =  0  (since  ft^0^  “  °) 

s°  Pt  <  (ft^xt^xt^Pt+l  * 

Letting  we  have 

pt  i  qtqt+l’  *  ^T-l15? 

and  this  establishes  the  desired  bound,  and  shows  the 
existence  of  the  competitive  prices. 

We  would  like  now  to  establish  some  sort  of 
analogue  of  Theorem  1  asserting  that  competitive  programs 
are  optimal,  but  since  we  do  not  have  the  concept  of 
final  stocks  some  additional  condition  will  be  required. 
Before  continuing  we  consider  a  concrete  example. 
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EXAMPLE  1.  Let  u(c)  =  -  ,  f  (x)  =  px,  s  =  1, 

where  p  is  some  positive  constant. 

?£S.PQS,itlon  x.  The  sequence  <  x.  ,c.  >  is  a  program 

00  U  t 

if  and  only  if  £  c  ,/pfc  <  1. 

t=0  r 

Proof.  We  have 

C0  =  1  “  x0 
ct  c  p* t-i  “  H  ■ 

Multiplying  the  equation  by  l/p^  and  summing  gives 

t?o  °t//pt  “  1  ~  xt/pT  • 

T 

Conversely ,  let  q,  =  2  c/p*  and  let  xT  -  pT(l-qT) . 

t—  0 

Then  xQ  =  l-c0  and  x^px^  =  pT(l-qT)-ppT"1(l-qT_1)  = 

P  (Qt_<jT-1^  “  CTJ  30  <  xt,ct  >  is  a  Pr°gram. 

Proposition  2.  Competitive  programs  exist  if  and  only 
if  p  >  1. 

Proof.  Let  pfc  be  competitive  prices.  Then  from  (A) 
u{c)  —  Ptc  =  -l/c  +  ptc  is  maximized  at 
thence  u'(ct)  «=  pt  -  l/ct2  or 

ct  "  l/VPt 


(3.1) 
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and  hence  ct  and  xfc  are  positive  for  all  t.  Prom  (B) 
(pt+ip“Pt)x  is  maximized  at  xfc,  so  p^.+1  a  P t/pi  hence 
(3-2)  pt  -  PQ/pt  • 

Letting  a  »  4~p  we  have  from  (3.1)  ct  =  0^/4 p^ 

and  c./p  =  l/(^p7  cr).  The  series  2  c./p  will  converge 
T  u  t=0 

if  and  only  if  p  >  0,  in  which  case 

(3-3)  I  ct/pt  =  o//p^  (ff-1), 

t=o 

so  <  xt,ct  >  is  competitive  if  and  only  if  cfc  »  l//p£  O* 
where 

P0  £  (tf/<7-l)2  • 

Proposition  3.  The  optimal  program  is  <  xt,ct  > 
where  c^.  ■ 

If  there  is  any  optimal  program  it  must  be  competitive 
by  Theorem  4  and  clearly  the  best  of  the  competitive 
programs  (3*1)  is  the  one  for  which  pQ  ■  (a/o-1)  . 

However,  we  can  prove  directly  that  this  program  is 
optimal.  Let  <  ct,xt  >  be  any  other  program.  Prom 
(A)  and  (3*2)  we  have 

u(ct)  -  (p0/pt)ct  ^  u(ct)  -  (P c/Pt)®t  or 


18 


(3.4)  u(cfc)  -  u(ct)  i  p0(ct/pt  —  c^/p^)  . 

Further  If  for  some  t  ct  ^  then  (3*4)  Is  strict. 
Summing  on  t  gives 

T  T  . 

(3. 5)  Su(ct)  -u(ct)  <  pQ  ^  {ct/pt  -  ct/pty) , 

t=l  t=l 

but  from  Proposition  1  and  the  fact  that  2  c./p  =  1, 

t=l  r 

it  follows  that  the  right  hand  side  of  (3*5)  converges 
to  some  non-positive  number  and  hence  the  left  hand 
side  must  eventually  become  and  remain  negative,  proving 
the  asserted  optimality. 

We  now  prove  a  converse  of  Theorem  4. 


THEOREM  5*  If  <  x^,ct;pt  >  is  competitive 


and 


(3.6) 

then  <  xt,ct  >  is  optimal. 


lim  Ptxt  =  0 


Proof .  Let  <  xt,ct  >  be  any  program  from  s 
and  let  denote  the  profit  from  this  program  at 
prices  pt  in  period  t;  that  is 

wt  “  ptf^xt-l^  ~  pt-lxt-l'  “  ptf^xt-l^  “  pt-lxt-l‘ 
From  (A)  we  have 


V.  (3-7) 


u(o0)  -u(c0)  <  P0(o0-J0)  -  P0(s-x0)  -P0(»-*0)  -  'P0(*o"*o) 
u(ct)-u(ot)  <  pt(ct-ct)  -  PjlftXj.jJ-XjJ-pjtftij.j)-*,.),  t  >  1, 
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so 

T  T  T 

(3-8)  Eu(ct)-  Su(ct)  <  2  (*t“*t)  +Pt^t  “  Pt*T  * 
t— 0  t ""  0  t  — “  1 

Prom  (B)  rrt  <  7 so  the  sum  on  the  right  is  non-positive 
and  since  p^x.p  ->  0  it  follows  that  the  entire  right 
hand  side  becomes  less  than  any  preassigned  positive 
number  for  T  sufficiently  large,  which  is  the  definition 
of  optimality. 

Corollary.  If  u  is  strictly  concave  then  <  x^c^.  > 
is  strongly  optimal. 

Proof.  In  this  case  if  ct  ft  ct  for  some  t  then 
the  corresponding  inequality  of  (J.7)  becomes  strict 
and  the  argument  above  shows  that  (5.8)  becomes  negative. 

We  now  give  an  important  equivalent  interpretation 
to  condition  (5*6). 

We  define  pQs  to  be  the  initial  wealth  of  the 
economy.  We  define 
T 

WT  «  pQs  +  2  the  accumulated  wealth  up  to  period  T 

t*l 
T 

et  -  2  PtV  the  -SXPSflfllture  SR  consumption  up  to  period  T 
t=*0 

PTxT  «  value  of  stocks  in  period  T. 


Then  we  have  the  following  obvious  identity 


t 
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Pi r^T  =  ^r£  -^ip 

i.U 

which  is  obtained  by  multiplying  the  t~  equation  of 
(2.1),  (2.2)  by  pt  and  adding. 

In  particular  we  have 

(3-9)  %  <  WT 

which  is  an  obvious  budget  inequality,  stating  that 
expenditure  on  consumption  cannot  exceed  accumulated 
wealth.  Condition  (3.6)  now  becomes 

(3-10)  lim  (VJT  -  Bn)  s  0 

T-*»  1  A 

so  that  "at  infinity"  all  wealth  has  been  used  up  in 
consumption. 

The  condition  seems  like  a  reasonable  one  for 
optimality.  However,  it  is  not  a  necessary  condition. 
One  can  show  that  for  cases  in  which  the  function  f  is 
not  productive,  so  that  eventually  f(x)  <  x,  then 
P^Xq,  converges  to  some  positive  value  rather  than  to 
zero. 

We  call  a  program  efficient  if  it  satisfies  (3.10). 
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4.  The  Time  Independent  Case 

In  this  section  we  confine  ourselves  to  the  case 
where  f  and  u  are  independent  of  time.  We  need  one 
more  assumption  which  is  a  strengthening  of  Definition  4. 

DEFINITION  5-  The  function  f  is  strongly  productive 
if  there  is  a  constant  p  >  1  such  that 

f (x  +  h)  >  f(x)  +  ph. 

For  f  differentiable  this  is  equivalent  to  f'(x)  >  p. 

EXISTENCE  THEOREM.  If  f  is  strongly  productive  there 
exists  an  optimal  program  if  and  only  if  u  is  bounded. 

This  theorem  was  originally  proved  by  D.  McFadden 
for  the  case  of  f  a  linear  function.  We  first  prove 
the  necessity  of  the  boundedness  condition. 

LEMMA  2.  If  <  xt,ctjpt  >  is  competitive  then 

pt  i  p</pt- 

Proof.  From  (B) 

“  Vt  i  pt+if<x)  “  ptx  for  a11  x  i 0 

°r  pt+i^f^xt^  ~  f^x^  -  pt^xt 

or  pt+i/pt  i  i  p  for  x  >  xt 
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from  which  the  result  follows. 

LEMMA  3.  If  <  >  is  competitive  then  pt 

approaches  0  and  xt  and  ct  approach  »  monotonically . 

Proof .  The  first  assertion  follows  from  the 
previous  Lemma.  Suppose  (ct)  were  bounded.  Then 
there  would  exist  c  such  that  c-t  <  c  and 
u(c)  —  u(ct)  >  6  >  0  for  all  t.  But  from  (A) 

pt(c  —  ct)  >  u(c)  —  u(ct)  >  6  for  all  t 

and  we  have  seen  that  the  left  hand  side  above  approaches 
zero,  giving  a  contradiction.  Since  ct  becomes  infinite 
so  does  xt  and  monotoneity  follows  from  Theorem  3. 

THEOREM  6.  If  there  is  an  optimal  program  then  u 
must  be  bounded. 

Proof .  Let  <  xt»ct  >  be  an  optimal,  hence 
competitive,  program.  From  (A) 

u(c1)  -u{cQ)  <  P0(c1-cQ)  -  p^fU^-x^  -  p0(s-x0) 

u^ct+l^“u^ct^  »  pt^ct+l~ct^  “  pt^xt)“xt+l^"pt^xt-l^~xt^  * 
Summing  from  t  -  1  to  T 

u(cT+l)'u(oO)  i  PT(xT‘xT+l)+  “ 

t*  X 

-  +  »0(f(x0)  '  s) 
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but,  the  first  term  above  Is  non-positive  by  Lemma  5 

and  the  terms  in  the  summation  are  non-positive  from  (B). 

Hence 

u(cT+i)  i  u(Cq)  +  P0(f<x0'  ”  s)' 

so  u(ct)  is  bounded  for  all  t,  but  since  c^.  ■*  ® 
this  means  that  u  is  bounded. 

If  u  is  bounded  we  establish  the  existence  of  an 
optimal  program  by  taking  the  limit  as  T  •+  »  of  T-pericd 
programs,  as  follows: 

Let  PT  ■  <  x^,c^; p^  >  be  a  T-period  program  which 
maximizes  £  u(cfc)  (the  final  stoclcs  this  case 

t— C  m  |J> 

are  zero).  Now  for  a  fixed  t,  the  3ets  Ix^)  and  (c t } 
are  bounded  for  all  T.  If  in  addition  we  knew  that 
{p^}  was  bounded,  then  a  standard  "diagonal  process" 
argument  would  establish  the  existence  of  a  competitive 
program  P,  a  point -wise  limit  of  the  programs  P  .  Our 
procedure  will  be  first  to  prove  the  boundedness  of  (p£) 
and  then  to  show  that  P  is  efficient  and  hence  optimal, 
by  Theorem  5* 

We  first  need  a  fundamental  inequality. 


LEMMA  4.  If  <  xt,ct;pt  >  is  competitive 


then 


£  <  P*  ct  +  pMct)  -  u(cT  )]/(p-l) 

t-T,  c  c  1 
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41 

it 


Proof.  Prom  (A) 

T-l 

u(cT)  -u(cT  )  -  X  ^u^ct+l^ 

1  *1 
T-l 

*  2  (pt  ~  Pt+i)ct+i  + 
ll 

-  £  (i  -  pt+i/pt>ptct 

so  from  Lemma  2 

u(cT)  -u(cT  )  <  (1  -  J*)  2  ptct 
1  T1 
and  we  obtain  (#)  by  rearranging. 

We  need  a  simple  property  of  bounded  functions. 

DEFINITION  6.  The  member  p  is  a  support  of  the 

w 

function  u  at  the  point  c  if  u(c)  —  u{c)  <  p  (c-c) 

*  C 

for  all  c.  (If  u  is  differentiable  then  p„  ■  u'(c). 

Note  that  pt  is  a  support  of  u  at  in  any  competitive 

program. ) 

LEMMA  5*  If  u  is  bounded  and  p  is  a  support  of 

v 

u  at  c  then  11m  pc  •  C. 

G 

Q+m  w 

Pyop.t.  Let  u  ■  sup  u(c)  and  choose  c  so  that 

c£0 

u(c)  ^ii-  c/2,  hence  u(c)  -  u(c)  $  c/2  for  all  c  £  0. 
Then 


T-l 

s  £  pt<ct+i 


ct> 


PtcT  ~  PT1CT1 


+  ptct  Pt1ct1 


I^n  c«r 
A1 
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e/2  >  u { c )  -  u(c)  >  pc(c  -  c )  =  Pcc(l  -  c/c) 

so  if  c  >  2c  then  p  c  <  e. 

c  = 

COROLLARY.  The  set  of  numbers  pc  is  bounded. 

c 

We  now  get  a  first  economic  application. 

THEOREM  7.  If  u  is  bounded  there  exists  a  number  M 
such  that  for  any  competitive  program  <  xt,et;pt  >  the 
quantity  ET  *  £  ^tct  =  M  for  T* 

Proof.  Apply  {*)  with  T^  «  0  to  get 

&T  <  PqCq  +  p/p-l[u(cT)  ~u(c0)) 

but  the  right  hand  side  is  bounded  by  hypothesis  and 
the  preceding  Corollary. 

COROLLARY.  If  <  xt,ct;pt  >  is  an  infinite  competitive 

m 

program  then  £  Pfc*  converges. 

t-o  *  c 

T  T 

LEMMA  6.  For  the  programs  P  the  prices  p->  satisfy 

pj  <  M/s  . 
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T 

and  since  tr.  >  0  the  result  follows, 
t  » 

T 

COROLLARY.  The  prices  satisfy 

T  .  M/  T 
pt  <  M/sp 

Proof.  Lemma  2 . 

THEOREM  8.  There  exists  an  infinite  competitive  program. 


Proof.  Take  the  point -wise  limit  of  the  programs 
T 

P  and  call  this  limit  P.  It  is  a  standard  exercise 

to  verify  that  P  is  a  competitive  program. 

To  complete  the  existence  theorem  we  must  prove 

00 

that  P  is  efficient.  Let  E  *  £  pLc,  which  exists 

t=0  z  z 

00 

by  the  Corollary  to  Theorem  J.  Let  VJ  =  pns  +  2  ir  . 

v  t-1 

We  must  show  that  this  expression  converges  and  that 
E  =  W. 

T 

It  will  be  convenient  to  consider  the  program  P 
to  be  infinite  with  the  convention  that  for  t  >  T 

*t  ■  ct  *  °- 
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c.  •+  «  (Lemma  J>)  hence  uCc*.)  •*  p.  «  sup  u(c)  and 

c>0 

Ptct  -*■  0  since  pt  is  a  support  of  u  at  cfc  (Lemma  5) 

then  we  can  choose  t_  so  that  p,  c.  <  e/2  and 

e  = 

p/ p~l  ( M-  “  u(cT  ))  <  e/2  for  all  T.  Now  apply  (*)  and 

we  have 
T 

£  P^ct  i  pt  ct  ~  u^ct  ^  i  €  for  a11  T* 

fc=t€  6  €  t€-l  e  t€-l 

Finally,  by  (3-9),  £  <  PqS  +  £  ^  so 

fc= 0  t*—  2. 

2  ^  <  £  p*e*  =  f  <  e. 

t=t  ^  t=t  ^  t-t  Z  ^  t=tg  U  b  “ 

THEOREM  9.  The  program  P  is  efficient. 

Proof .  Let  ET  =  E  PfC?  and  let  WT  *  p^s  +  E  ir? 

t— o  1  1  u  t=»l  1 

00 

and  let  E  =  Ep^c,.  Now  for  T' ,T  >  t_  it  follows 
t=0u  € 

rpi  m  T*  T 

from  Lemma  7  that  E  —  E  <  e  and  V/  —  W  <  e, 

T  T 

so  (E  )  and  (w  )  are  Cauchy  sequences  and  converge 

T  T 

to  their  point -wise  limits  E  and  W.  But  E  ■  V/ 
for  all  T,  hence  E  »  U,  completing  the  proof. 
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APPENDIX.  The  Case  of  More  than  One  Good. 

In  these  notes  the  entire  analysis  of  optimal 
programs  has  been  based  on  the  use  of  competitive  prices, 
and  the  existence  of  these  prices  therefore  played  a  key 
role.  To  establish  their  existence  we  were  at  some  pains 
in  the  proofs  of  Theorems  4  and  8  to  obtain  an  a  priori 
bound  on  the  values  of  prices  for  finite  horizon  programs. 
This  boundedness  requirement  is  no  mere  mathematical 
technicality  but  is  quite  essential  to  the  understanding 
of  the  models.  We  will  here  illustrate  this  further  by 
considering  a  very  simple  two  good  model  in  which  there 
is  an  obvious  optimal  program  which,  however,  is  not 
competitive. 

The  model  involves  both  a  production  good  P  and 
a  consumption  good  Q  ,  and  there  is  a  single  joint 
process  for  producing  both.  Namely  from  x  units  P 
invested  in  period  t  one  obtains  px  units  of  P  and 
x  units  of  Q  in  period  t  +  1.  Assuming  initial  stock 
of  P  is  1  a  program  <  xt,ct  >  must  satisfy 

xQ  £  l,  xt  £  P*t-1  and  ct  ■  xt  for  a11  fc* 

The  Inequalities  here  simply  have  the  meaning  that 
one  can  throw  away  either  production  or  consumption  goods. 

Now,  it  is  perfectly  clear  that  by  any  reasonable 
definition  of  optimality  the  only  optimal  program  is 
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f 

xt  »  ct  «  p  since  any  other  program  involves  needless 
throwing  away.  It  also  follows  from  the  Kuhn-Tucker 
Theorem  that  every  T-period  optimal  program  is  competitive 
for  any  utility  function  u.  However 

THEOREM  10.  If  p  >  1  and  the  utility  function  u  is 
unbounded  then  the  optimal  program  is  not  competitive. 

Proof.  We  must  first  write  down  the  competitive 

conditions.  Let  pfe  and  be  the  prices  of  P  and  Q 

in  period  t.  Condition  (A)  then  remains 

(A1)  u(c)  —  q^c  is  maximized  at  ct  , 

and  the  profit  condition  (B)  at  time  t  is  clearly 

(B* )  q^x  +  Pt+1Px  —  Ptx  is  maximized  at  xfc. 

Suppose  now  that  <  p^p^  >  is  optimal.  Then  from 
(B* )  we  must  have 

(1)  Qt  ■  <Pt  -  PPt+1) 

and  from  (A1 ) 

u(pt)  -u(pt+1)  >  qt(pt  -  pt+1)  -  (l-p)pt(pt  -  PPt+1). 
Summing  from  t  ■  0  to  T-l  gives 

u(  1)  -  u(pT)  £  (1  -  p)(p0  -  p^)  or 
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(p-l)p0  >  u(pT)  -u(l)  +  (p-l)pTpT  >  u(pT)  -u(l) 

but  if  u  is  unbounded  this  is  impossible  since 
have  to  be  infinite. 


for  all  T 
pQ  would 
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Optimal  Control  and  Convex  Programming1 
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INTRODUCTION 

The  problems  arising  in  optimal  control  theory  are  similar  mathematically 
to  those  met  in  the  calculus  of  variations,  with  additional  requirements 
in  the  form  of  inequality  constraints  which  must  be  satisfied.  The  subject 
received  its  initial  impetus  from  problems  arising  in  the  area  of  guidance 
and  control,  and  the  basic  results  of  Pontryagin  ct  at.  (1962)  are  developed 
from  this  point  of  view,  as  is  much  of  the  subsequent  work  on  this  subject 
(Leitmann,  1962).  However,  as  emphasised  by  Bellman,  Glicksberg,  and 
Gross  (1958),  a  continuous  spectrum  of  problems  encountered  by  systems 
analysts,  operations  researchers,  economists,  and  management  consultants 
in  various  phases  of  industrial,  scientific,  and  military  activity  can  be 
included  in  an  appropriate  formulation  of  control  theory.  Two  such 
potentially  important  applications  are  dynamic  economic  models  (Usawa, 
2964)  and  long-range  capital  investment  studies. 

As  a  greatly  simplified  example  of  the  latter  application,  suppose  that 
the  control  u(t)  is  the  rate  of  investment  at  time  t  The  state  of  the  system 
x(<)  is  described  by  the  quantity  of  the  t'th  product  x,(f)  produced  by 
time  f.  The  x,(f)  are  determined  for  any  given  v(t)  by  the  system  of  dif¬ 
ferential  equations 

i  -  f(x,  u,  /),  x(0)  *  x#,  t *  JO,  Tj. 

The  rate  of  investment  is,  of  course,  nonnegative  and  also  may  not  exceed  a 
specified  upper  bound,  so  that  0  <,  w (t)  £  a.  Furthermore,  it  is  required  that 
the  production  schedule  satisfy  the  state  constraints  pt(t)  x,(t)  $  f ,(<), 


1  Sponsored  in  pvt  by  NASA  grant  NsO  065  and  In  part  by  tbs  Mstb—s rics 
Bewreb  Cnotnr,  United  SUtas  Army,  Mndteoa,  Wisoontea  under  Contract  No. 
DA-1 1-022-ORD-30M. 
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where  the  p<(<)  and  q{(t)  are  specified,  and  that  this  be  done  so  as  to 
minimise  the  total  discounted  investment 

pfu]  -  jf  dt 

over  a  finite  time  T.  Because  of  the  presence  of  the  state  constraints, 
this  problem  is  of  a  typ*  which  is  difficult  both  theoretically  and  computa¬ 
tionally  (see  Berkonts,  1962,  and  Pontryagin  et  al.,  1962,  chap.  6).  In 
actual  practice  the  investment  decisions  would  not  be  made  continuously 
but  rather  at  discrete  intervals,  say,  once  a  month.  This  is  typical  of  a 
dynamic  process  which  can  be  formulated  as  continuous  but  which  is 
more  usefully  considered  as  discrete,  since  this  gives  both  a  more  realistic 
model  and  a  computational  method  of  solution. 

The  two  important  questions  to  be  answered  are: 

1.  Will  any  admissible  (0  <  u(t)  <  a)  investment  program  satisfy  the 
production  constraints?  That  is,  does  an  admissible  control  exist? 

2.  If  there  are  admissible  controls,  how  do  we  find  one  which  is  optimal? 

The  remainder  of  this  paper  is  devoted  to  answering  these  two  questions 
for  a  general  class  of  discrete  optimal  control  problems. 

8ome  of  the  material  here  is  based  on  parts  of  an  earlier  report  (Rosen, 
1964).  The  author  has  also  had  the  benefit  of  several  discussions  with 
J.  Abadie,  whose  work  in  this  area  has  been  most  stimulating. 

DiecaiTK  PROBLEM  WITH  STATB  CONSTRAINTS 

It  will  be  useful  to  give  a  further  motivation  for  the  approach  taken 
here  for  the  solution  of  optimal  control  problems.  Such  problems  fall 
naturally  into  two  classes  depending  on  their  initial  formulation,  namely, 
continuous  and  discrete.  In  general,  we  will  solve  the  continuous  problems 
on  a  digital  computer;  this  will  require  the  numerical  integration  of  systems 
of  differential  equations— in  fact,  a  discrete  approximation  to  the  contin¬ 
uous  process.  We  may  therefore  assume,  at  least  for  computational  pur¬ 
poses,  that  we  will  always  be  dealing  with  discrete  problems. 

To  be  specific,  we  will  consider  a  discrete  problem  as  follows:  Let  x<  t  iT 
represent  the  state  vector  at  time  l<  (t  *  0,  1,  •••  ,  m)  and  «  S'  the 
corresponding  oontrol  vector  for »  -  0,  I,  •••,*»-  1.  The  initial  value 
St  is  specified,  and  we  wish  to  determine  the  vectors  x<  and  «<  so  as  to 
minimise 


*(*«,  *0, 


who*  the  x,  and  w,  must  satisfy  the  recursion  relation 


(1) 
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z<+l  ~  Xi  -  /(*,,  u,),  *  -  0, 1,  •  •  •  ,  m  -  1,  (2) 

where  each  u,  must  be  selected  from  a  convex,  compact  subset  U  Q  S' 
and  where  xm  must  lie  in  a  convex,  compact  subset  Y„  C  E9.  We  acmme 
that  o(x,  u)  is  a  function  from  E9  X  U  to  El  with  <r  t  Cl  on  E9  X  V  and 
that  j{x,  u )  is  a  function  from  E9  X  U  to  E9  with  /  c  C‘  on  E9  X  U.  We 
asoime,  further,  that  the  sets  V  and  Xm  are  each  specified  by  a  system 
of  inequality  constraints,  that  is, 


Xm  “  \x  |  q(x)  <  0} 

(3) 

u  -  ju  |  h(u)  <  0} , 

(4) 

where  g(x)  is  a  function  from  E9  to  Ek  with  g  t  Cl  and  convex  on  E9  and 
where  h(u )  is  a  function  from  ET  to  El  with  h  t  Cl  and  convex  on  S’. 
The  sets  Xm  and  U  are  assumed  to  be  nonempty;  by  the  convexity  of 
g(x)  and  h(u),  they  are  convex. 

We  may  think  of  this  discrete  problem  as  arising  from  a  finite  difference 
approximation  to  the  continuous  problem 

min  /  e(x(t),u(t))  dt.  (5) 

Jo 

where 

«),  /*{0,  T], 

x(0)  *  x0,  x(T) « Xm,  (8) 

v(l)  tV,  t'\0,T). 

The  sum  (1)  is  the  simplest  approximation  to  the  integral  (5)  with 
At  -  T/m,  t ,  *  tAl,  and  e  -  Ale.  The  recursion  relation  (2)  is  the  simplest 
finite  difference  approximation  to  the  differential  equation  (6)  with  /  <*  Ai/. 

We  may  now  consider  the  discrete  problem  as  the  minimisation  of  a 
convex  function  on  a  finite-dimensional  Euclidean  space  subject  to  the 
equality  constraints  (2)  and  the  inequality  constraints  (3)  and  (4).  For 
problems  of  this  type,  the  appropriate  theory  is  that  developed  by  Kuhn 
and  Tucker  (1951);  see  also  Karlin  (1959)  and  (1963).  For  our 
purposes,  the  most  convenient  statement  of  this  theory  is  essentially  that 
given  by  Berge. 

Wc  let  #  »  m(n  +  r)  and  denote  by  t  •  E*  the  vector  t’  -  (x(,  • 

*4,  •  •  •  ,  where  un primed  vectors  are  column  vectors  and  where 
the  prime  denotes  transpose.  We  wilt  call  t  an  odmumble  point  if  the 
x<  (t  »  !,•••,  m)  satisfy  (2),  rm  t  Xm,  sod  u,  t  V  (»  -  0, 1,  •  •  *  ,  s»  -  1). 
Suppose  we  have  an  admissible  point  s*t  determined  by  x*, »  •  1,  •  •  • 


« 
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u%  i  -  0,  •  •  •  ,  m  —  1.  We  will  denote  by  0,(z£)  the  k  X  n  Jacobian 
matrix  of  g(x)  evaluated  at  x*  and  by  K(v*{)  the  J  X  r  Jacobian  matrix 
of  h(u)  evaluated  at  We  will  also  denote  by  0„(zj)  the  matrix  in  which 
we  have  replaced  by  zeros  the  jth  row  of  g,(x*)  if  the  jth  element  of  g(x*)  <  0. 
Thus,  we  have  g'(x*)0.(x*)  **  0.  The  matrix  A»(u^)  is  defined  similarly 
for  i  -  0, 1,  •  •  •  ,  to  —  1,  so  that  V(u^)A«(u*)  =  0  (t  *  0, 1,  •  •  • ,  m  —  1). 
We  also  let  /»(z,  u)  and  /.(z,  u)  denote  the  n  X  n  and  n  X  r  Jacobian 
matrices  of  /. 

An  admissible  direction  2  at  2*  is  given  by  vectors  f<  (t  =  1,  •  •  •  ,  m) 
and  Hi  (i  —  0,  ••  •  ,  m-  1)  such  that 

£<♦1  -  =  f.(xt,  u*j£(  +  /,(zf,  t  -  0,  •  •  •  ,  m  —  1, 

So  -  0 

and 


<  o, 

<0,  t  -  0, 1,  •  •  •  ,  m  -  1. 

It  follows  that  if  y  t  E'  is  not  an  admissible  direction  at  z*  then  it  points 
outward  from  the  set  of  admissible  points  at  z*;  that  is,  z*  +  ay  is  not 
an  admissible  point  for  every  sufficiently  small  a  >  0. 

The  sum  (1)  to  be  minimized  is  given  in  terms  of  z  by  letting 

*(*)  *  ]C  <r(z,,w<).  (7) 

•  -0 

We  will  say  that  an  admissible  point  z*  is  a  relative  minimum  if 

<t>(z*)  <  *(«*  +  cd)  (8) 

for  every  admissible  direction  2  at  z*  and  sufficiently  small  a  >  0. 

We  can  now  state  the  necessary  Kuhn-Tucker  conditions  for  a  relative 
minimum. 

Theorem  1A  :  If  an  admissible  point  z*  is  a  relative  minimum,  then 
there  exist  vectors  X<  t  E*  (t  =  1,  •  •  •  ,  m),  a  vector  vm  >  0,  rm  «  Ek,  and 
vectors  >  0,  ift  t  El  (t  =  0,  1,  •  •  •  ,  m  —  1)  such  that 

vLg{x*)  =  0,  (9) 

v'Mtf)  =  0,  t-0J.---.m-l,  (10) 

and  such  that  the  Lagrangian  function 

$>(*)  -  <b(z)  4-  £  X'+1[xi+1  -  z<  -  /(zi(  «,)]  4-  v'mg(xm)  4-  £  Min,) 

»-0  » -0 


(11) 
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has  a  stationary  point  at  z  -  z*.  that  ist 

*.(«*)  =  o. 


m) 


Proof :  The  proof  is  essentially  that  given  in  Kuhn  and  Tucker  (1951) 
or  Berge  (1963)  and  is  based  on  the  Farkas  lemma. 


Corollary:  At  a  relative  minimum  z*t  the  value  of  the  Lagrangi&n  func¬ 
tion  <t>(z)  and  the  sum  (1)  are  equal.  Furthermore,  the  vectors  A(,  r„, 
and  r)i  must  satisfy  the  following  system  of  equations: 


A<+i  -  A,  =  u*)\i+,  +  al(x*,  u*i),  *  «  1,  •••  ,  m  —  1,  (13) 

A.  =  -gi(x*K,  (14) 


and 

=  K(x*,  u*)Ai0  ~  <r'(x?,  v*),  %  =  0,  1,  •  •  *  ,  m  —  1. 


(15) 


Proof :  Because  of  tho  v. .  iplementary  requirements  (9)  and  (10)  and 
the  fact  that  the  r  '.niasible  pent  z*  satisfies  (2),  we  have 


4>(z*)  =  <j>C*)  =  £  <r(x*u*). 
*  — 0 


The  system  (13)  th  ough  (15)  is  equivalent  to  (12)  and  is  obtained  by 
Si  tting  co  zero  the  partial  derivative  of  4>  with  respect  to  each  component 


J  t  is  clear  from  the  form  of  (13)  that  this  recursion  relation  for  the  A< 
is  closely  related  to  the  usual  adjoint  equation  for  the  continuous  problem. 
The  terminal  value  Xm  for4the*adjoint  vector  is  specified  by  (14). 

In  Theorem  1A,  necessary  conditions  for  a  relative  minimum  were 
given  with  no  conditions  on  a(x,  u )  and  f(xt  u)  other  than  differentiability. 
We  now  show  that  if  c(x,  u)  is  convex  on  FT  X  U  and  if  f(x,  u )  is  linear 
on  FT*  X  U,  then  the  conditions  are  also  sufficient  for  a  global  minimum. 

Theorem  SA :  Let  a(x,  u)  be  convex  and  f(x,  u)  be  linear  on  J 5"  X  U. 
If  z*  is  an  admissible  point  and  there  exist  vectors  A,  and  nonnegative 
vectors  vm  and  ij<  such  that  (9),  (10),  (13),  (14),  and  (15)  are  satisfied, 
then  z*  is  a  global  minimum. 

Proof :  We  will  denote  by  Z  C  E’  the  direct  product  of  the  sets  xt  t  FT 
(*  =  1,  •  •  •  ,  m)  and  u<  t  U  (*  =  0,  1,  •  •  •  ,  m  —  1).  Then  the  function 
4(z)  is  convex  on  Z,  since  each  term  is  convex  on  FT  X  U.  The  first  sum¬ 
mation  in  (11)  is  linear  in  z  and  therefore  also  convex  on  Z.  The  remaining 
two  terms  are  convex  by  assumption  and  by  the  fact  thAt  rm  and  the 
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are  nonnegative,  Therefore,  #(*)  is  convex  on  2.  Now  a  stationary  point 
of  a  convex  function  is  a  global  minimum,  so  that 

*(**)  -•  min  #(*).  (16) 

Ml 

As  above,  we  have  +(z*)  »  #(**).  Furthermore,  for  every  admissible 
point  z,  we  have  from  (2)  through  (4)  that 

*0  <  *0.  (17) 

Then,  from  (16)  and  (17),  $(**)  <  $(:)  for  every  admissible  point  s, 
so  that  z*  is  a  global  minimum. 

By  means  of  a  straightforward  modification,  the  previous  results  can 
be  extended  to  include  the  case  of  constraints  on  the  state  vectors  xt 
(i  «  1,  •  •  •  ,  m  —  1)  in  addition  to  the  constraint  xm  t  Xm.  To  show  this, 
let  us  require  that 

x{zXit  »*=!,•••,  m,  (18) 

where  each  X{  is  a  convex  subset  of  E*,  specified  in  terms  of  convex  func¬ 
tions  g*(x)  from  E*  to  Ekt,  for  i  **  1,  •  •  •  ,  m,  with  gm(x)  —  g{x).  T»e  there¬ 
fore  have 

Xt  »  {x  |  g\x)  <  0},  i  —  1,  •••  ,m.  (19) 

Note  that  Xm  is  identical  to  that  given  by  (3).  An  admissible  point  is 
now  one  which  satisfies  (2),  (18),  and  u<  z  U. 

The  extension  of  Theorem  1A  to  this  state-bounded  problem  is  given  by: 

Theorem  IB  :  If  an  admissible  point  z*  is  a  relative  minimum,  then  there 
exist  vectors  X<  zE*  (i  «  1,  •  •  • ,  m),  vectors  >  0,  p{  tE*'  (*  -  1,  •••,»»), 
and  vectors  >  0,  c  ■fit*  (t  »  0, 1,  •••,?»-  1)  such  that 

i^Ort)  -  0,  i  -  i,  •••  ,  m,  (20) 

0,  »  *  0,  •  •  •  ,  m  —  1,  (21) 

and  such  that  the  Lagrangian  function 

*(z)  -  <K*)  +  £  X'+i[x<+i  -  x(  -  /(*„  u<) ] 

<’°  (22) 

t,  <■:»'(*.)  +  z5i»w 

<-l  (-0 

has  a  stationary  point  at  *  -  z*. 
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Similarly,  the  corollary  to  Theorem  1A  now  becomes: 

Corollary.  At  a  relative  minimum  z*t  we  have 

#(**)  =  4>(z*)  =  2  *(*?.  «*)• 

•  •0 

Furthermore,  the  vectors  X<,  vif  and  >j<  must  satisfy  (14),  (15),  and 
X.+,  -  X,  =  -f'(x*  ut)X,+,  -j-  *£(*?.  ut)  + 

i=l,-,m-l  (23) 

Finally,  the  extension  of  Theorem  2A  gives: 

Theorem  2B:  Let  <r(x,  u)  be  convex  and  /(x,  u)  be  linear  on  FT  X  U. 
If  z*  is  an  admissible  point  and  there  exist  vectors  >  and  nonnegative 
vectors  vi  and  such  that  (20),  (21),  (23),  (14),  and  (15)  are  satisfied, 
then  z*  is  a  global  minimum. 

CONVEX  PROGRAMMING  SOLUTION 

We  are  now  in  a  position  to  consider  the  computational  solution  of  the 
discrete  optimal  control  problem  with  state  constraints.  We  limit  our 
discussion  here  to  problems  for  which  the  optimality  conditions  are  suffi¬ 
cient,  namely,  <r(x,  u)  convex  and  /(x,  u)  linear.  In  the  interest  of  simplicity, 
we  will  also  assume  that  the  constraint  sets  U  and  Xt  are  defined  by  linear 
inequalities,  that  is,  that  the  functions  h(u)  and  g*(x)  are  linear.  The 
method  for  /(x,  u)  linear  discussed  here  is  the  basis  for  a  convergent 
iterative  procedure  for  solving  the  more  general  case  where  /(x,  u)  is 
convex.  This  more  general  case  is  described  in  another  paper  (Roeen,  1966). 

The  general,  variable  coefficient  linear  case  will  be  considered,  that  is, 
a  discrete  approximation  to  the  differential  equation 

t  =  A(t)x  +  B(t)u,  te  [0,7*1.  (24) 

We  will  let  A*  =  A(t{)  and  B{  =  B{tt)  (i  —  0,  •  •  •  ,  m)  and  use  the  finite 
difference  approximation 

•Tt+i  ~  Xi  —  A/[#A,+1x,+i  -f*  (1  ~  0)A,x<]  AtBiUi, 

•  -0,1,  (25) 

where  0  <  0  <  1.  For  0  =  0,  this  gives  the  explicit  (forward)  scheme  (2), 
while  for  0  -  1  it  gives  the  fully  implicit  (backward)  scheme.  The  value 
0  -  i  gives  a  numerically  stable  method  with  minimum  truncation  error. 
The  relation  (25)  may  be  solved  for  x<+i  to  give 

x<*,  -  KiXt  +  Btu(,  i  -  0, 1,  •  •  •  ,  m  —  1,  (26) 
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where 


end 

S,  “  m  [7  "  »  (») 

The  solution  to  the  finite  difference  equation  (26)  is  given  by 

xt  -  Y<x o  +  Y(  £  A ,-+15,u#,  t  -  1,  •  •  •  ,  m,  (29) 

/-o 

where  the  matrices  F<  satisfy  the  homogeneous  equation 

F^.-tf.F,,  F0  =*  /,  <-0.1.  ••  ,m-l,  (30) 

and  where  the  matrices  A<  satisfy  the  homogeneous  adjoint  equation 

A<  -  K[ Am,  F«a:  =  /.  <  -  m  -  1,  •  •  •  ,  1.  (31) 

It  follows  from  (30)  and  (31)  that  F.  AJ  =  F,+lA'+1  =  /,  so  that  A{  —  F71 
(<  —  1,  •  •  •  ,  m).  Furthermore,  the  actual  calculation  of  F.x0  and  of  the 
coefficients  of  the  u,  in  (29)  requires  only  the  inversion  of  an  n  X  n  matrix 
to  get  each  Kt  and  the  multiplication  of  n  X  r  matrices.  These  quantities 
are  therefore  readily  calculated  from  the  specified  values  of  x0,  A(t), 
B(t),  m,  and  6. 

Because  of  the  linearity  of  (29),  we  can  use  these  relations  to  map 
the  original  problem  into  the  control  space,  that  is,  the  product  space 
of  the  u(.  This  reduces  the  original  problem  to  one  of  minimising  a  convex 
function  subject  to  linear  inequality  constraints  in  the  space  ET\  Since 
the  original  problem  involved  s  «  m(n  +  r)  variables,  and  since  r  <t  n 
(often  with  r  =  1  or  r  =*  2),  this  may  effect  a  considerable  reduction  in 
the  number  of  variables.  To  accomplish  this  reduction,  we  replace  each 
xt  in  the  sum  (1)  and  in  the  linear  inequalities  g\xi)  <  0  which  define 
the  Xt  by  the  corresponding  righthand  side  of  (31).  Each  vector  /(x<) 
thus  gives  rise  to  kt  linear  inequalities  on  the  u<.  We  also  have  the  original 
set  of  l  linear  inequalities  A(u,)  <  0,  which  ensures  that  each  u,  t  U. 
We  therefore  have  a  system  of 

ml  4-  2  k4  -  m(l  +  £) 

<-l 

linear  inequalities  which  must  be  satisfied  by  any  admissible  set  of  vectors 
U/.  Because  of  the  way  in  which  these  inequalities  arise,  they  have  a 
special  structure  which  can  be  used  to  advantage.  We  will  represent  the 
inequalities  obtained  from  the  g'(x( )  by 
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H  D',u,  -  p  <  0,  (32) 

1-0 

where  each  Dt  is  an  r  X  ink  matrix  and  where  p  t  E*1.  Because  *<♦»  involves 
only  values  of  u,  for  j  <  i,  the  matrix  D'  =  [Z)£D{  •  •  •  D^.,1  has  a  lower 
triangular  structure.  The  matrices  D,  will  depend  on  the  matrices  A{,  B{, 
and  the  matrices  which  define  the  linear  transformations  g,(xi)}  as  well 
as  on  8  and  m.  The  vector  p  will  also  depend  on  x0,  as  well  as  on  these 
other  quantities.  The  important  point,  however,  is  that  the  matrices  D, 
and  the  vector  p  can  be  explicitly  computed  with  a  reasonable  amount 
of  computation. 

In  order  to  simplify  the  discussion,  we  will  denote  by  w  t  ETT  a  vector 
which  specifies  the  control  for  t  *»  0,  1,  •  •  •  ,  m  —  1,  that  is,  to*  — 
(«o,  u,',  •  •  •  ,  ufm. ,).  Two  subsets  of  Emr  are  then  given  by 

IT,  *  !  £  Dfa  -  p  <  o}  (88) 

and 

Wt  -  {to  I  k(u ,)  <  0,  j  -  0,  •  •  •  ,  m  -  1).  (34) 

Since  it  is  determined  by  linear  inequalities,  Wx  is  closed  and  convex 
if  it  is  not  empty.  Since  Wt  is  the  direct  product  of  compact  convex  sets, 
it  is  compact  and  convex.  Then 

W  ~  Wt  A  Wt  (35) 

is  compact  and  convex  if  it  is  not  empty. 

The  first  important  question  about  the  discrete  problem  can  now  be 
answered:  Does  there  exist  any  admissible  control?  This  is  equivalent 
to  the  question:  Is  W  an  empty  set?  Good  computational  methods  are 
available  for  determining  if  a  solution  to  a  system  of  linear  inequalities 
exists  and,  if  so,  for  finding  such  a  solution.  Since  the  inequalities  of  (33) 
and  (34)  have  the  natural  form  of  constraints  for  a  dual  linear  programming 
problem,  a  dual  simplex  procedure  can  be  used  for  this  purpose  (Dantxig, 
1963).  The  starting  procedure  for  the  gradient  projection  method  (Rosen, 
1960)  is  equivalent  to  this  and  may  conveniently  be  used  for  this  purpose. 
Another  approach  would  be  to  use  the  duality  theory  of  linear  programming 
and  to  consider  the  primal  problem  corresponding  to  the  dual  constraints 
(33)  and  (34)  and  an  arbitrary  linear  dual  objective  function.  This  objective 
function  can  always  be  chosen  so  as  to  give  an  initial  primal  feasible 
solution.  The  duality  theory  then  says  that  if  the  primal  problem  has  a 
finite  maximum,  the  corresponding  dual  solution  is  dual  feasible  (that  is, 
an  admissible  control ).  Any  mu  mMo  linear  programming  code  can  therefore 
be  used  for  this  purpose. 
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Once  we  have  determined  that  an  admissible  control  exists,  and  in 
fact  have  actually  determin  H  «uch  a  control,  we  can  proceed  to  find 
an  optimal  control.  We  do  this  by  once  again  using  (29)  to  eliminate 
the  xt,  this  time  in  the  sum  (1),  to  get  a  function  p(w)  to  be  minimized. 
Since  convexity  is  preserved  by  a  linear  transformation,  the  function  p(to) 
is  convex.  We  have  now  reduced  the  original  discrete  problem  to  that  of 

finding 


p(tc*)  =  min  p( w), 

*«ir 

that  is,  the  minimization  of  a  convex  function  subject  to  linear  inequality 
constraints.  Furthermore,  we  have  an  admissible  control  to0  (determined 
as  discussed  above)  with  which  to  start  the  minimization  procedure.  A 
number  of  computationally  tested  methods  are  available  for  the  solution 
of  such  convex  nonlinear  programming  ^problems  (Rosen,  I960,  and 
Hadley,  1964).  In  the  special  case  where  <r(x,  u)  is  linear  on  F  X  U, 
the  problem  can  be  solved  in  the  dual  form  by  a  dual  simplex  method  or, 
in  its  primal  form,  by  any  primal  simplex  code.  The  possibility  of  formulat¬ 
ing  a  discrete  linear  optimal  control  problem  as  a  linear  programming 
problem  has  been  considered  by  Zadeh  and  Whalen  (1962),  An  efficient 
method  of  solution  for  linear  problems  with  large  values  of  m  has  been 
proposed  by  Dantzig  (1966),  based  on  his  generalized  upper-bounding 
technique. 

Once  the  optimal  control  w *'  =  (uj',  u? •  •  •  ,  u*l,)  has  been  calculated 
in  this  way,  the  optimal  state  vectors  x*  (»  =  1,  •  •  •  ,  m)  are  immediately 
given  by  (29).  The  Lagrange  multipliers  (or  shadow  price  vectors) 
(*  -  1,  •  •  •  ,  m)  and  ij,  (i  =  0,  •  •  •  ,  m  —  1),  corresponding  to  the  state 
and  control  constraints,  are  also  available  as  part  of  the  convex  pro¬ 
gramming  solution.  These  quantities  may  be  of  considerable  interest  since 
they  give  the  rate  of  decrease  in  the  function  value  with  relaxation  of 
each  constraint.  The  influence  of  parameter  changes  on  the  optimal  solution 
can  also  be  obtained  by  use  of  the  parametric  solution  features  of  many 
codes.  Finally,  if  desired,  the  optimal  adjoint  vectors  satisfying  (23) 
with  /(x,  u)  linear  can  be  calculated  from 

X|  -  A<+,  -  <rj(x?,  u*)  -  0)'(x*K,  *  »  m  —  1,  •  •  •  ,  1, 

starting  with  Xm  - 

COMPUTATIONAL  EXAMPLE 

The  previous  discussion  will  now  be  illustrated  by  means  of  a  variable 
coefficient  linear  problem  with  four  state  variables  and  a  scalar  control. 


Optimal  Control  and  Convex  Programming  555 


In  addition  to  bounded  control,  we  also  impose  state  constraints  on  one 
of  the  state  variables.  The  system  considered  is  in  the  form  (24),  with 

TA(t)  -  tAf  +  (T  -  t)A0 

and 


The  control  must  satisfy  |w(t)|  <  1,  so  that  h(u)  =  _  jj-  We  also 

impose  the  terminal  constraints  xt(T )  -  x«(T)  «  0  and  the  state  con¬ 
straint  |x«(t)|  <  0.5  for  0  <  t  <  T.  These  give 


We  wish  to  minimize  the  terminal  Euclidean  norm  ||x(D!|.  This  system 
is  similar  to  a  constant  coefficient  system  for  which  a  numerical  solution 
has  previously  been  obtained  (Ho  and  Brentani,  1963). 

The  optimum  solution  to  this  problem,  using  the  finite  difference 
scheme  (25)  with  8  -  $,  is  shown  in  Figures  1  and  2.  The  values  T  -  2.5 
and  m  »  25  were  used  so  that  At  -  0.1.  The  optimum  control  is  shown 
in  Figure  1,  and  the  trajectory  as  given  by  the  four  state  variables  *,(<<) 
O'  -  1,  •  •  •  ,  4;  t  «  0,  •  •  •  ,  25)  is  shown  in  Figure  2  for  the  case  with  no 
state  constraint  on  xt(t().  The  minimum  value  of  the  objective  function 
attained  is  ||x(T)J|a  ■»  0.008456.  The  optimal  solution  to  the  same  problem 
with  the  state  constraint  |x«(t<)l  <  0.5  is  shown  in  Figures  3  and  4.  The 
distinct  change  in  the  control  required  to  satisfy  the  state  bound  should 
be  noted,  as  well  as  the  increase  in  the  terminal  norm  squared  to  0.026922, 
which  is  due  to  the  fact  that  the  admissible  control  set  IT  is  smaller  because 
of  the  state  bound. 


1  Figure*  l  through  4  appear  on  pager.  fSi~6;  text  resume*  on  page  M98) 


Fiouu  1.  Fourth-order  nonautonomoue  lyetem. 
Optimal  control  with  objective  function  value  0.0086. 


Fiounn  3.  Fourth-order  nonautonomoua  eyatem. 

Optimal  etate  trajectory  with  objective  function  value  0.0086. 
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These  solution*  were  obtained  by  using  a  program  based  on  the  scheme 
described  by  (25)  through  (34).  The  convex  programming  problem  ob¬ 
tained  in  this  way  was  solved  by  using  the  gradient  projection  computer 
program  (SHARE  distribution  /1399).  The  solution  time  required  for 
each  of  these  problems  on  the  ibm  7090  was  approximately  two  minutes. 
The  program  and  its  use  to  obtain  the  optimal  solution  to  a  variety  of 
typical  problems  will  be  described  elsewhere  (Rosen  and  O’Hagan,  1906). 
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DISCUSSION 

H .  Halxin  :  At  the  beginning  of  your  paper,  you  said  that  the  condition 
was  necessary  and  sufficient.  At  that  time  you  did  not  specify  exactly 
the  case  where  this  would  be  true. 

J.  B.  Rosen:  The  question  is:  When  are  the  Kuhn-Tucker  conditions 
sufficient  for  the  discrete  control  problem?  The  answer  is  that  in  general 
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they  are  only  sufficient  for  a  linear  system  of  difference  equations,  that  is, 
when  /(z,  u)  in  (2)  is  linear.  This  is  because  for  sufficiency  the  admissible 
space,  over  which  the  minimization  is  carried  out,  must  be  a  convex  set.  The 
admissible  set  consists  of  those  points  —  (z{,  •••,*«•!*•••,  ul-i), 

which  satisfy  (2),  x.  t  Xm,  and  *  U  (»  *  0,  •  •  •  ,  m  —  1).  This  cannot 
be  convex  if  /(x,  u)  is  nonlinear. 

J.  Moser:  Is  there  any  analysis  which  tells  you  or  gives  you  an  indica¬ 
tion  whether  the  discrete  problem  really  approximates  the  continuous 
problem? 

J.  B.  Rosen:  The  question  of  convergence  of  the  discrete  problem 
optimal  solution  to  an  optimal  solution  of  the  corresponding  continuous 
problem  is  closely  related  to  the  set  of  reachable  points  xm  t  F  given 
by  (29)  with  u,  i  U.  For  specified  values  of  x#,  d(l),  and  B(t),  the  reachable 
set  will  depend  on  m  and  8.  What  we  would  like  is  that  the  reachable 
set  expands  as  At  — >  0  or  m  — »  ® .  It  can  be  shown  that  this  will,  in  fact, 
be  the  case  under  certain  conditions  if  8  is  correctly  chosen  in  (25).  On 
the  other  hand,  simple  ex*  nples  can  be  constructed  by  using  the  explicit 
scheme  (8  —  0),  where  the  reachable  set  shrinks  as  m  — ♦  ®.  In  such  a 
situation,  one  may  find  that,  as  the  grid  size  is  decreased  (m  — ■>  ®),  a 
value  exists  such  that  for  all  larger  m  it  is  no  longer  possible  to  reach 
the  terminal  manifold.  In  such  a  case,  one  clearly  does  not  have  convergence. 

J.  Moser:  I  think  it  would  be  an  interesting  question  to  investigate 
the  conditions  which  would  ensure  convergence. 

H.  Halein  :  There  is  a  paper  by  Professor  Markus  (paper  6)  on  the 
stability  of  solutions  of  optimal  control  problems  with  respect  to  changes 
in  the  data  of  the  problems.  I  think  that  if  a  problem  is  stable  in  Professor 
Markus’  sense,  then  the  solution  of  a  discretisation  of  this  problem  will 
tend  to  the  solution  of  the  problem  itself  as  the  discretisation  is  made 
finer.  Professor  Markus  give*  an  answer  to  such  a  problem  in  the  case 
of  a  linear  system  with  constant  coefficients. 
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ITERATIVE  SOLUTION  OF  NONLINEAR  OPTIMAL  CONTROL 

PROBLEMS* 

J.  3.  ROSENf 

Abstract.  The  solution  of  nonlinear,  state-constrained,  discrete  optimal  control 
problems  by  mathematical  programming  methods  is  described.  The  iterative  solution 
consists  essentially  of  Newton’s  method  with  a  convex  (or  linear)  programming  proL- 
lem  solved  at  each  iteration.  Global  convergence  of  the  iterative  method  is  demon¬ 
strated  provided  a  convexity  and  constraint  set  condition  are  both  satisfied.  The 
computational  solution  of  nonlinear  equation  control  problems  makes  use  of  a  previ¬ 
ously  developed  method  for  state-constrained  linear  equation  problems.  The  solution 
method  for  nonlinear  problems  is  illustrat<  d  by  means  of  two  numerical  examples. 

1.  Introduction.  The  optimal  control  problem  considered  here  is  a  rather 
general  type  of  discrete  problem.  We  wish  to  minimise  a  convex  function  of 
the  state  and  contro1  vectors,  where  the  control  vectors  must  lie  in  a  speci¬ 
fied  convex  set.  In  addition  the  state  vectors  must  also  satisfy  specified 
constraints  at  each  discrete  time,  as  well  as  initial  and  terminal  conditions. 
Furthermore,  the  system  dynamics  may  be  given  by  a  nonlinear  recursion 
relation  provided  that  the  nonlinearity  is  convex  in  an  appropriate  way.  A 
discrete  system  of  the  type  considered  here  may  represent  a  process  which 
is  actually  discrete  (see,  for  example,  [3],  [1]),  or  it  may  be  obtained  from 
a  finite  difference  approximation  to  a  continuous  system  in  which  we  wish 
to  minimize  a  convex  functional.  Such  an  approximation  is  always  required 
when  a  numerical  .migration,  using  a  digital  computer,  is  part  of  the 
solution  process. 

The  purpose  of  this  report  is  to  describe  a  computational  method  for 
solving  this  general  type  of  discrete  problem,  and  to  show  by  means  of  the 
relevant  theorems  that  the  method  will  always  work  when  the  appropriate 
assumptions  are  satisfied.  The  method  is  an  iterative  procedure  that  deter¬ 
mines  a  sequence  of  admissible  trajectories  (state  and  control  vectors 
satisfying  all  constraints);  the  sequence  converging  to  an  admissible  tra¬ 
jectory  that  satisfies  the  necessary  conditions  for  optimality.  The  method 
has  been  used  to  obtain  numerical  solutions  to  several  small  nonlinear  test 
problems.  In  addition  to  showing  that  it  is  not  difficult  to  implement  the 

*  Received  by  the  editors  June  28,  1965.  Presented  at  the  First  International 
Conference  on  Programming  and  Control,  held  at  the  United  States  Air 
Force  Academy,  Colorado,  April  15, 1965. 

t  Computer  Sciences  Department  and  Mathematics  Research  Center,  University 
of  Wisconsin,  Madison,  Wisconsin.  This  research  was  sponsored  in  part  by  the  Na¬ 
tional  Aeronautics  and  Space  Administration  under  Research  Grant  NGR-50-002-028 
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ORD-2059. 


223 


224 


J.  B.  ROSEN 


scheme  described  here,  these  numerical  results  show  that,  at  least  for  the 
test  problems  considered,  the  number  of  iterations  required  is  small. 

In  a  previous  publication  [14]  a  statement  of  the  Kuhn-Tucker  condi¬ 
tions  was  given  for  the  nonlinear  state-constrained  problem  considered 
here.  A  computational  procedure  for  systems  described  by  linear  recursion 
relations  was  also  given  based  on  a  convex  (or  linear)  programming  com¬ 
puter  code.  Numerical  results  described  there  show  that  this  computational 
procedure  is  efficient  for  typical  linear  systems.  The  method  described  in 
the  present  paper  takes  advantage  of  this  efficiency  by  solving  a  sequence 
of  such  linear  problems.  From  this  point  of  view  the  method  of  the  present 
report  may  be  thought  of  as  Newton’s  method  (see,  for  example,  [9])  with 
a  convex  (or  linear)  programming  problem  solved  at  each  iteration.  The 
use  of  various  forms  of  Newton’s  method  for  the  numerical  solution  of  opti¬ 
mal  control  problems  has  been  proposed  in  a  number  of  earlier  publications 
[4],  [6],  [10],  [12].  The  two  important  differences  between  the  method  de¬ 
scribed  here  and  these  earlier  proposals  are  that  (1)  in  the  present  method 
global  convergence  is  assured  when  a  convexity  and  constraint  set  condition 
are  both  satisfied,  and  (2)  large  changes  in  both  the  control  and  state 
vectors  may  take  place  at  each  iteration  until  these  vectors  are  close  to  their 
limiting  values,  thereby  greatly  accelerating  convergence  during  the  early 
states.  The  limiting  convergence  rate  is  quadratic,  as  expected  in  Newton’s 
method. 

Another  way  of  looking  at  this  method  for  nonlinear  problems  is  that  at 
each  iteration  we  get  an  admissible  and  optimal  trajectory  which  satisfies 
a  linear  recursion  relation  which  differs  to  some  extent  from  the  true  non¬ 
linear  recursion  relation.  At  each  iteration  the  amount  by  which  the  lineari¬ 
zation  is  in  error  decreases,  so  that  in  the  limit  the  trajectory  obtained  is 
an  optimal  solution  to  the  linearized  problem  obtained  by  linearizing  about 
the  limiting  trajectory.  Since  it  is  the  recursion  relation  which  is  linearized, 
the  limiting  trajectory  is  the  optimal  solution  to  a  control  problem  described 
by  linear  recursion  relations.  It  therefore  follows  that  for  the  class  of  dis¬ 
crete  nonlinear  problems  considered,  the  optimal  solution  has  the  properties 
of  a  solution  to  a  discrete  problem  with  linear  recursion  relations. 

The  requirement  that  the  state  vectors  satisfy  specified  constraints 
usually  increases  the  difficulty  of  the  optimal  control  problem  (see,  for 
example,  [5]  and  [13,  Chap.  6]).  In  the  approach  used  here  to  solve  the 
state-constrained  discrete  problem,  the  convergence  proof  uses  the  fact 
that  the  state  vector  at  each  discrete  time  belongs  to  a  convex  compact  set. 
In  this  sense  then,  the  liability  of  the  state -constrained  problem  has  now 
become  an  asset.  The  existence  of  state  constraints  also  introduces  a  sym¬ 
metry  into  the  problem,  so  that  the  usual  sharp  distinction  between  the 
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(independent)  control  vectors  and  (dependent)  state  vectors  largely 
disappears. 

The  method  described  here  applies  to  a  recursion  relation  in  the  form  of 
a  system  of  inequalities,  and  might  represent  a  finite  difference  approxima¬ 
tion  to  a  system  of  differential  tnequalttie*.  By  the  use  of  a  modified  objec¬ 
tive  function,  the  problem  usually  considered  corresponding  to  a  system  of 
differential  equations  can  be  handled.  The  “classical'*  two-point  boundary 
value  problem  can  also  be  solved  in  this  fashion  by  allowing  the  control 
vector  to  represent  the  error  in  the  difference  equations  and  minimising 
this  error. 

It  should  be  emphasised  that  while  the  convexity  assumption  is  needed 
in  order  to  prove  convergence,  the  computational  method  can  be  applied 
even  when  this  assumption  is  not  satisfied.  In  many  such  cases  the  iterative 
method  will  still  converge,  and  if  so,  the  trajectory  obtained  will  satisfy  the 
necessary  conditions  for  an  optimal  trajectory.  Furthermore,  at  each  itera¬ 
tion  a  linear  constraint  minimisation  problem  with  either  a  convex  or  linear 
function  is  solved.  Because  of  this,  the  method  will  almost  always  converge 
to  a  trajectory,  which  is  at  least  a  local  minimum  of  the  objective  function, 
rather  than  an  arbitrary  stationary  trajectory.  It  should  also  be  mentioned 
that  the  method  considered  here  requires  only  the  Jacobian  matrix  (first 
partial  derivatives)  of  the  system  equations,  and  does  not  need  the  Hessian 
matrix  (second  partial  derivatives)  as  required  by  some  other  computa¬ 
tional  schemes  [6],  [10],  [12].  For  many  nonlinear  problems  this  may  permit 
a  great  reduction  in  the  computation  required. 

While  the  iterative  method  described  was  developed  for  problems  arising 
in  control  theory,  it  may  also  be  used  to  solve  any  finite-dimensional  con¬ 
strained  minimisation  problem  of  the  general  type  considered.  In  this 
respect  the  method  is  also  a  contribution  to  the  solution  of  nonoonvex 
mathematical  programming  problems. 

2.  Problem  formulation.  The  discrete  optimal  control  problem  we  shall 
consider  here  is  to  determine  m  +  1  state  vectors  x*  t  E*  and  m  oonlrol 
vectors  u*  6  F  which  satisfy  (2.2),  (23)  and  (2.4)  and  such  that 

■—I  m—l 

(2.1 )  g  *(*<*,  O  -  min  <r(*< ,  u<) 

for  all  vectors  z<  and  u<  that  satisfy  the  recursion  relation 

(2.2)  z<+i  ~  “  /(*< » u<),  *  m  0, 1,  *  •  ,  wi  “  1, 

with 

(2.3)  «<  €  U,  C  F, 


*  -  0,  l,  •  •  ,  m  -  l, 
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and 

(2.4)  Xi  €  Xi  C  En,  i  -  0, 1,  ••• , m. 

The  subsets  Xi  and  C/<  are  assumed  to  be  compact  and  convex.  We  assume 
that  9  is  a  convex  function  from  each  direct  product  Xi  X  Ui  to  El.  We 
also  assume  that  /  is  a  function  from  each  Xt  X  Ui  to  E".  An  additional 
assumption  on  the  differentiability  and  convexity  of  the  components  of  / 
will  be  needed  later.  It  should  be  mentioned  that  the  results  obtained 
actually  hold  (with  obvious  modification)  for  the  more  general  case  where 
0  and/ may  depend  explicitly  on  the  index  i.  When  the  discrete  problem  is 
obtained  from  a  continuous  problem,  this  corresponds  to  the  explicit 
dependence  of  <r  and/ on  time.  However,  in  order  to  avoid  the  complication 
of  additional  subscripts  we  will  limit  consideration  to  the  simpler  case. 

A  discrete  problem  of  this  type  may  arise  directly,  or  it  may  arise  as  a 
finite  difference  approximation  to  a  continuous  system.  For  example, 
suppose  that  in  the  original  continuous  system  we  wish  to  determine  a 
control  w(0  with  range  U(t)  for  each  t  6  [0,  T],  and  a  trajectory  x(f) 
with  range  X(t)  for  each  t  €  [0,  T],  such  that  the  functional 

(2.5)  j £  u(t))  dt 

is  minimised,  and  x(t)  and  u(t)  satisfy  the  system  of  differential  equations 

(2.6)  x  -  /(*,  u),  t  £  [0,  T]. 

The  sum  (2.1)  then  represents  the  simplest  approximation  to  the  integral 
(2.5),  and  the  recursion  relation  (2.2)  the  simplest  finite  difference  approxi¬ 
mation  to  the  system  (2.6),  if  we  let  Ai  -  T/m,  9  -  Al  9,  and  /  -  At/. 
The  form  of  (2.2)  may  be  retained  even  when  more  sophisticated  finite 
difference  schemes  are  used  to  approximate  (2.6),  but  the  relationship 
between  /  and  J  will  become  more  complicated.  The  use  of  a  more  accurate 
implicit  finite  difference  scheme  when  /  is  linear  has  been  considered  in 
(14).  It  should  be  emphasised  that  in  this  paper  we  solve  the  discrete 
problem  for  a  fixed  value  of  m,  and  that  we  are  interested  in  convergence 
(for  fixed  m)  to  an  exact  solution  of  the  nonlinear  discrete  problem.  The 
convergence  to  the  solution  of  the  continuous  problem  as  m  -*  *  will  not 
be  considered  here. 

In  order  to  show  convergence  of  the  iterative  procedure  we  will  oonsider 
the  discrete  system  (2.1),  (2.3)  and  (2.4),  with  (2.2)  replaced  by  the 
system  of  inequalities 

(2.7)  *<+»  -  Xi  $  /(x< ,  Ui),  i  -  0, 1,  •••  ,  m  -  1. 

Such  a  system  of  inequalities  may  arise  as  a  discrete  approximation  to  a 
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system  of  differential  inequalities  of  the  form  x  £  J(x,  u).  On  the  other 
hand,  if  one  really  wants  to  solve  (2.2),  this  is  accomplished  by  obtaining 
an  optimum  solution  to  (2.7)  wiih  an  appropriately  modified  objective 
function,  as  discussed  at  the  end  of  this  section. 

In  order  to  simplify  notation  we  proceed  as  in  (14],  and  denote 
a  specific  control  (u<>  ,ui /,---,um_l)  and  corresponding  trajectory 
(xq  ,*/,•••,  xj)  by  a  single  vector  26  E1,  where  a  **  m(r  4-  n)  4-  n. 
Thus,  a  solution  to  the  discrete  system  if  specified  by  the  vector 

(2.8)  z  -  (io'  ,*/  ,•••,*.',  tio'  ,«/,•••  ,  t4-i). 

We  will  also  denote  by  Z  C  E*  the  direct  product  of  the  sets  X ,  and  (/< ,  so 
that 

(2.9)  Z-flXt  xTl  Ut . 

t-0  «-0 


Since  the  sets  X,  and  (/,  are  convex  and  compact,  Z  is  also  convex  and 
compact.  We  can  now  represent  the  objective  by  means  of  the  function 

*— 1 

(2.10)  <Mz)  -  £*(*,-,  Ui). 

1—0 

It  follows  from  our  assumption  concerning  o  that  4(2)  is  convex  on  Z. 
Finally  we  represent  the  l  *  mn  equations  (2.2)  or  inequalities  (2.7)  by 
means  of  a  function  v(z)  from  £*  to  El.  We  let 

V*,i  m  f>(Xi ,  Mi)  4  Xt.y  j  , 

(2.11) 

1  -  0, 1,  •  •  •  ,  m  -  1,  ;  -  1,  •  •  •  ,  n. 

The  equations  (2.2)  are  then  given  by  v(z)  -  0,  and  the  inequalities 
(2.7)  by  v(z)  £  0.  In  this  notation  we  can  restate  our  problem  (2.1), 
(2.3),  (2.4)  and  (2.7)  as  follows: 

(2.12)  4(2*)  -  min  (4(2)  \z  6  Z,  e(s)  £  0}. 

§ 

Some  remarks  on  the  nature  of  the  admissible  set 

S  -  {2  1 2  6  Z,  i?(x)  2  0} 

are  in  order  here.  The  set  Z  is  by  assumption  convex  and  oompect,  and  in 
fact  will  usually  be  a  polyhedral  set  in  E *.  The  admissible  set  oorrer  iidin* 
to  the  original  discrete  problem  (2.2),  (2.3)  and  (2.4)  is  given  by 

5j  «  jr  |  2  6  Z,  0(2)  •  0|. 

The  sei  is  convex  only  if  v(z)  is  linear  in  2,  that  is,  f(x,  u)  is  linear  in  x 
and  u.  If  one  or  more  components  of  /  are  nonlinear  in  x  or  u,  the  set  S,  is 
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nonconvex.  For  a  general  nonlinear  function  /(x,  u),  the  set  £  is  also  non¬ 
convex.  The  iterative  procedure  of  the  following  sections  can  be  applied  to 
such  problems  and  will,  in  fact,  often  converge.  However,  there  is  no 
guarantee  in  the  case  of  a  general  nonlinear  /  that  the  procedure  will 
always  converge.  In  order  to  prove  convergence  we  require  that  each 
component  of  v(z)  be  a  convex  function.  It  should  be  emphasized  that 
this  is  not  the  requirement  which  makes  £  a  convex  set  (except  in  the 
limiting  case  where  v(z)  is  linear).  The  set  £  is  convex  if  each  component 
of  v(z)  is  a  concave  function.  Thus  the  convergence  argument  holds  for  the 
minimization  of  a  convex  function  over  a  certain  kind  of  nonconvex  region. 

If  we  actually  want  to  satisfy  (2.2)  we  must  obtain  a  solution  to  the 
problem  ^(z*)  =  minl€fll  $(z);  that  is,  we  require  v(z*)  **  0.  In  order  to 
achieve  this  and  still  solve  a  problem  in  the  form  of  (2.12)  we  let 

(2.13)  <Kz)  "  ♦(*)  + 

••i 

where  a  is  a  sufficiently  large  positive  constant.  Since  each  component 
Vij  is  a  convex  function,  4>{z)  is  a  convex  function.  We  then  solve 
min,ca$(z),  which  is  in  the  form  of  (2.12).  It  is  shown  in  the  Appendix 
that  provided  the  constraint  set  £  satisfies  a  certain  condition  (essentially 
the  same  condition  which  insures  convergence)  there  will  always  exist  a 
value  of  a  such  that  any  local  minimum  of  $(z)  for  z  £  5  is  also  a  local 
minimum  of  <t>(z)  for  z  £  £|  . 

We  are  now  able  to  describe  the  iterative  method  for  solving  the  discrete 
optimal  control  problem  in  terms  of  the  (in  general,  nonconvex)  mathe¬ 
matical  programming  problem  (2.12). 

8.  Linearized  problem.  Let  Z  be  a  compact  convex  subset  of  E%  and  v(z) 
be  a  function  from  Z  to  E1  with  v  6  Ca(Z).  We  assume  that  for  some 
Zwe  have  v(z°)  >  0  and  define  a  subset  of  E‘  by 

(3.1)  £  =  {z  |z  €  Z,  v{z)  £  0J . 

Since  *°  €  £,  the  set  £  is  not  empty.  Also  since  £  is  a  closed  subset  of  Z 
it  is  compact  but,  in  general,  not  convex  (see  Fig.  1). 

If  we  let  v,(y)  be  the  l  X  s  Jacobian  matrix  of  v  evaluated  at  z  *  y,  we 
can  define  for  each  fixed  y  6  Z  the  linear  function  on  Z, 

(3.2)  to(z,  y)  *  v(y)  +  c,(y)[z  -  yj. 

For  each  y  £  Z  we  obtain  a  subset  of  E*  given  by 

(3.3)  W(y)  -  |z  (  w(z ,  y)  £  0|. 
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Now  we  consider  the  point-to-set  mapping 

(3.4)  r  :Z-*Z, 
given  by 

(3.5)  ry-W(y)nZ. 

This  is  illustrated  in  Fig.  2. 

Theorem  1.  The  aet  Ty  is  compact  and  convex.  Furthermore,  if  each  com- 
pune.nt  of  v(t)  it  convex  on  Z,  then  for  each  y  €  S, 

(3f>)  y  €  Ty  C  5. 

Proof.  For  each  y,  the  set  W(y)  is  the  intersection  of  l  half  spaces,  a 
closed  convex  set.  Therefore  the  intersection  of  JF(y)  and  the  compact 
convex  set  Z  is  compact  and  convex.  Next  we  note  that  since  y  6  S, 

(3.7)  w(v,  y)  ■  p(y)  2  0, 

so  that  y  (.  W'(y).  Then  since  y  f  Z,  we  have  y  6  Ty. 

Furthermore,  by  the  convexity  of  p(f),  we  have  for  any  (y,  *)  6  SXS, 

(3.8)  v(z)  }z  i»(y)  -F  e,(y)[«  -  y)  -  tc(r,  y). 

Then  for  each  z  (  W'(y), 

(W)  v(t)  £  w(t,  y)  £  0, 

so  that  for  every  z  €  Wr(y)  f)  Z  we  have  if  5,  or  Fy  C  S. 
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Directly  from  (3.6)  we  get  the  following. 

Corollary.  Tj/  map*  S  onto  S. 

The  constraints  for  the  problem  have  now  been  defined  in  terms  of 
the  convex  subset  Z  and  the  function  v(z).  The  objective  function  is  given 
by  a  function  $(x)  from  Z  to  El  which  is  continuous  and  convex  on  Z. 
The  iterative  procedure,  starting  with  an  initial  point  .y°  £  S  can  now  be 
stated  in  a  concise  form.  A  sequence  |y'|  is  obtained  which  satisfies 

(3.10)  *(y#+I)  -  min  *(z),  >  »  0, 1,  •• 

*«r*> 

Such  a  sequence  is  obtained  by  solving  a  well  behaved  convex  eonatnined 
minimisation  problem  with  i  £  IV,  to  get  the  minimum  4(y,+l)  at  a 
point  y**1  £  Ty *.  The  convexity  of  the  subset  Ty’  and  the  function  *(z) 
insure  that  a  global  minimum  of  $(*)  for  z  £  IV  is  attained  at  t  «  y'*\ 
Suppose  that  the  sequence  jy*}  converges  to  a  limit  point  y*.  We  would 
like  to  be  able  to  state  that  the  point  y*  is  the  optimum  solution  to  the 
partially  linearized  problem  obtained  by  linearizing  the  constraints 
i'(*)  £  0,  about  *  *«  y*.  That  is,  we  want 

(3.11)  #(y*)  -  min#U). 

•(  r»* 

In  terms  of  the  original  discrete  optimal  control  problem  (2.1),  (2.3), 
(2.4)  and  (2.7),  this  is  equivalent  to  the  statement  that  the  control 
u,*,  t  •>  0,  1,  •  -  •  ,  m  —  1,  and  trajectory  x,*,  i  -  0,  1,  •  •  •  ,  m,  give  an 


NONLINEAR  OPTIMAL  CONTROL  PROBLEMS 


231 


optimal  solution  to  the  problem  obtained  by  linearising  (2.7)  about  uf 
and  x*. 

However,  without  some  further  assumption,  the  relationship  (8.11)  may 
not  hold.  This  is  shown  by  the  following  simple  two-dimensional  example 
Let 

(3.12)  Z  -UiOSs,*  1,0  Sit,  ^1} 
and 

(3.13)  v (*)  -4 (*»-*)•-%, 
so  that  the  feasible  set  S  is  given  by 

13.14)  S  -  |*|  4(*i  -  i)1  -  *  2  0, 0  3  *,  2  1, 0  £  s>  2  lj. 

This  is  illustrated  in  Fig.  3.  Also  let  #(*)  -  tj .  We  have 

(3.15)  w(*.  y)  -  f(y)  +  8(yi  -  \){tx  -  y,)  -  (fi  -  y*), 
so  that  for  y*  -  ( 1, 0)  we  get 

(3.16)  r„' «  {*14*!  —  *,  —  3*0,0  S*,S  1,0  i  »,  S  1). 
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The  solution  to  (3.10)  for  j  -  0  is  easily  seen  (from  Fig.  3)  to  be  yl  -  f. 
The  sequence  \y*\  obtained  in  this  way  converges  to  y*  -  (i,  0),  with 
4>(y*)  -  §.  But  Ty*  is  the  interval  [0,  I)  on  the  tx  axis,  so  that 
min,€r»*  4(*)  "  0,  and  is  attained  at «  »  (0, 0)  ^  y*. 

In  order  that  the  limit  point  y*  always  satisfy  (3.11)  it  is  sufficient  that 
the  mapping  Ty  be  continuous.  The  mapping  Ty  is  continuous  (both  upper 
and  lower  semicontinuous)  if  for  any  point  y1  £  S  and  any  point  y*  €  S 
in  the  neighborhood  of  yl,  there  is  some  point  of  Ty1  close  to  each  point 
of  Ty*.  The  continuity  of  Ty  follows  from  two  assumptions  we  make  con¬ 
cerning  the  set  S. 

(1)  For  each  y  €  S,  the  Jacobian  matrix  v,(y)  has  full  row  rank,  that  is, 
rank  «1S*. 

(2)  For  each  y  €  S,  the  convex  set  Ty  contains  interior  points. 

These  two  assumptions  are  essentially  the  Kuhn-Tucker  constraint  qualifi¬ 
cation  for  the  set  <S  (see,  for  example,  [2]).  The  proof  that  (1)  and  (2) 
imply  the  continuity  of  Ty  is  given  in  the  Appendix.  A  slightly  stronger 
assumption  than  (2),  which  however  involves  only  the  rank  of  an  aug¬ 
mented  Jacobian  matrix,  is  also  given  there. 

The  difficulty  in  the  previous  two-dimensional  example  occurs  because 
the  assumption  (2)  above  is  not  satisfied.  In  particular,  for  y*  =*  (},  0), 
Ty*  is  just  the  interval  [0, 1],  As  a  result  the  mapping  Ty  is  not  continuous 
in  the  neighborhood  of  y*. 

The  first  assumption  above  is  always  satisfied  when  the  function  v(z) 
is  defined  by  (2.11),  as  shown  in  the  following. 

Lemma.  If  v(z)  corresponds  to  the  discrete  recursion  relation,  as  given  by 
(2.11),  then  assumption  (1)  is  satisfied. 

Proof.  Directly  from  (2.11)  we  have  that 


dVj.j 

dZi+l,} 


(3.17) 


dvuj 

dx%+i,p 


=  0, 


V  *  j, 


~~  -  0,  q  >  i  +  1,  p  *■  1,  •  •  ■ ,  n, 

OXqtp 

for  t  =  0,  1,  •  •  ■  ,  m  —  1  ;j  =  !,-••,«.  Therefore  the  Jacobian  matrix  v, 
contains  a  square  (mn  X  mn )  lower  triangular  matrix  with  elements  —  1 
along  its  diagonal.  Since  such  a  matrix  is  nonsingular  and  since  v,  has  mn 
rows,  v,  has  full  row  rank. 

4.  Convergence  of  iterative  procedure.  The  iterative  procedure  wilt  now 
be  considered  in  more  detail.  We  again  consider  the  convex  function  <l>  from 
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Z  to  E\  with  <t>  €  Cl(Z).  Since  S  is  compact,  +(z)  is  bounded  and  attains 
its  minimum  for  z  €  S.  In  particular,  let 

(4.1)  m  ■  min^(*). 

16* 

For  each  y  €  S,  the  set  Ty  is  compact  so  that  the  minimum  of  <t>{z)  for 
z  €  Ty  is  attained.  We  let 

(4.2)  ¥(v)  “  min$(z). 

»€T» 

We  now  show  that  because  of  the  continuity  of  Fy,  the  function  ♦(y)  is 
continuous  for  y  €  S. 

Lemma.  ¥(y)  u  continuous  for  y  6  S. 

Proof.  For  yl  €  S,  let  ¥(y‘)  be  attained  at  tl  €  Ty1,  that  is 
^(y1)  =*  ^(z1).  Now  choose  y*  6  S  close  to  y1,  and  let  ¥(y*)  be  attained  at 
z*.  so  that  ¥(y*)  *  $(z*).  Suppose  ^(s*)  ^  +(zl).  Now  by  the  continuity  of 
Fy  we  can  choose  I1  €  Ty1  close  to  z*.  Then  by  the  continuity  of  $(z)  we 
have  ^(2*)  close  to  $(z*).  But  since  ^(z1)  ^  $(z)  for  every  z  €  Ty*,  we  have 

(4.3)  *(**)  £  *(z>)  S  ‘♦(f1), 

so  that  $(z‘)  is  close  to  $(z*). 

A  similar  argume.it  holds  for  ^(z1)  £  ^(z*). 

Starting  with  y°  €  <S  we  generate  a  sequence  of  vectors  (y^  as  follows: 

(4.4)  *(y'+l)  -  min  *(z),  j  -  0, 1,  •  •  •  . 

Note  that  if  Z  is  a  polyhedral  set  then  Ty'  is  a  polyhedral  set  determined 
by  specified  linear  inequalities.  Furthermore,  $(z)  is  a  convex  function,  so 
that  for  each  yi  we  solve  a  straightforward  convex  programming  problem 
with  linear  constraints. 

Theorem  2.  Every  vector  of  the  sequence  [y*]  is  in  S.  The  corresponding 
sequence  of  values  ($(yy)}  is  monolonically  decreasing.  The  sequence  {yj 
contains  a  convergent  subsequence  converging  to  a  point  y*  €  S  such  that 

(4.5)  M  £  4>{y*)  &  *(yy),  j  -  0, 1,  ..  , 
and 

(4.6)  *(y*)  -  min  *(z). 

*€Tif* 

Proof.  By  Theorem  1,  we  have  yy  €  Ty*  C  S,  so  that  each  y*  is  in  S. 
Also  since  y 1  C-  Ty *  we  must  have 

(4.7)  *(?/")  =  min  6(z)  S  *(yy), 

*€  r*> 

s,o  that  !<Myy)]  is  moiiotonically  decreasing. 
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Since  S  is  bounded  the  sequence  {j/'j  contains  a  convergent  subsequence. 
Let  y *  be  the  limit  point  of  such  a  convergent  subsequence.  Since  S  is  com¬ 
pact,  y *  6  S,  and  #(y*)  i  Furthermore,  from  the  monotonicity  of  the 
sequence  {<t>(y3)\  the  relation  (4.5)  must  hold. 

To  demonstrate  (4.6),  we  observe  that  since  y*  £  S,  we  have  y*  6 
so  that 

(4.8)  V(y*)  =  min  <f>(z)  £  4>(>f). 

»6lV 

Now  suppose  that  4'()/*)  <  <t>(y*).  Then  by  the  continuity  of  'Hy)  we  can 
pick  k  sufficiently  large  so  that  ^(if)  <  <i>(y*).  But  from  (4.2)  and  (4.4) 
we  have  <t>(yk+l)  =  'k(yk),  so  that  <f>(ykk')  <<£(&’*),  which  contradicts 
(4.5).  Therefore  we  must  have  Viy*)  —  4>(y*). 

Theorem  3.  Let  y*  be  a  limit  point  of  j  y!\ .  Then  if  is  the  global  minimum 
of  the  partially  linearized  problem  about  the  point  y*.  Furthermore ,  the  opti¬ 
mality  conditions  ( the  Kuhn-Tucker  necessary  conditions)  which  must  be 
satisfied  at  a  global  minimum  of  the  problem  (2.12)  are ,  in  fact,  satisfied  at  y*. 

Proof.  The  set  Ty*  is  the  intersection  of  Z  and  the  convex  set  W(y*) 
obtained  by  linearizing  the  constraints  c(z)  ^  0,  about  z  =  if.  It  follows 
immediately  from  (4  6)  that  y*  is  a  global  optimum  solution  to  this  partially 
linearized  problem. 

As  mentioned  in  the  previous  section,  the  assumptions  (1)  and  (2)  on 
the  set  S  are  equivalent  to  the  Kuhn-Tucker  constraint  qualification.  It  is 
shown  in  their  original  paper  [11]  that  with  this  qualification  the  optimum 
solution  z*  to  a  general  nonlinear  problem  has  the  property  that  the  gradient 
V<£(z*)  must  belong  to  the  convex  cone  of  inward  normals  to  the  active 
constraints  at  z*.  The  solution  y*  to  the  partially  linearized  problem  about 
y*  will,  of  course,  also  have  this  property.  Therefore,  ~4>(y*)  belongs  to 
the  convex  cone  of  inward  normals  to  the  active  constraints  at  y  ,  i.e.,  the 
Kuhn-Tucker  necessary  conditions  for  a  global  minimum  are  satisfied 
at  y*. 

5.  Computational  solution.  The  computational  solution  of  the  nonlinear 
discrete  optimal  control  problem  (2.1)-(2.4)  is  considered  in  this  section. 
We  will  assume  that  the  convex  compact  sets  L\  and  X,  are  convex  poly¬ 
topes  defined  by  specified  linear  inequaiities  (see  Appendix).  In  order  to 
apply  the  computational  method  we  need  only  make  the  additional  assump¬ 
tion  that  the  functions  v(x,  u)  and /(*,  u)  are  of  class  Cl  on  each  Xi  X  Ui . 
However,  in  order  to  insure  rhe  validity  of  the  convergence  proof  (Theorem 
2)  we  must  make  an  additional  assumption  concerning/ and  an  assumption 
about* the  linear  inequalities  defining  the  A,  and  (/, .  We  assume  that  each 
component/,  of/,  /  =  1,  •  •  •  ,  n,  is  either  convex  or  concave  on  X,  X  f/, . 
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For  i  =  0, 1,  •  •  •  ,  m  -  1  and ,?  *  1,  •••  ,nwe  let 
Ui.j  ~  »  Wi)  "4*  > 

■  _  fvi.j  for/,  convex  on  X,  X  {/< , 

t/’"’  _  \  —  i>ti,  for/,  concave  on  X,  X  t/, . 

The  function  t(^),  with  components  Vij  f  is  thus  a  convex  function  on  Z. 
Furthermore,  the  equations  (2.2)  are  now  equivalent  to  v(z )  =  0. 

As  discussed  in  the  Appendix  the  linear  inequalities  which  define  the 
X,  and  l\  are  specified  in  terms  of  the  vector  z  by  0/2  —  fe,  ^  0, 
i  —  1,  •  •  •  ,  k,  giving  the  polyhedral  set  Z.  We  make  the  following  assump¬ 
tion  about  these  linear  inequalities.  Let  y  £  S  be  a  boundary  point  of  Z, 
i.e.,  v(y)  =  0,  and  a/y  —  5,  =  0,  i  -  1,  •  •  •  ,  £  Then  the  (!f^)  X  8 
matrix  consisting  of  v.(y)  augmented  by  the  rows  a / ,  i  =  1,  •  •  •  ,  £,  is  of 
full  row-  rank  ( =i  +  £).  According  to  the  I  omnia  at  the  end  of  §3,  v„(y) 
is  always  of  full  row’  rank,  so  this  assumption  is  essentially  a  condition  on 
the  vectors  a, .  As  shown  in  the  Appendix  it  follows  from  the  full  rank 
condition  that  Ty  is  a  continuous  mapping.  The  convergence  proof  of 
Theorem  2  is  applicable  because  0(2 )  is  convex  and  Ty  is  continuous. 

At  each  iteration  we  wish  to  solve  a  mathematical  programming  problem 
of  the  form, 

(5.2)  min  {<^(2)  |  a/2  -  6,  £  0,  i  *  1,  •  •  •  ,  k;  w{z,  y)  £  0}. 

f 

This  is  a  linear  constraint  problem  with  m(r  n)  -f-  n  variables  and 
k  -f- 1  coastraints.  For  small  problems  a  direct  computational  solution  of 
(5.2)  causes  no  difficulty.  In  many  practical  cases  however,  the  number  of 
state  variables  is  greater  than  the  number  of  control  variables,  i.e..  r  <  n. 
In  such  a  case  there  is  a  considerable  computational  advantage  in  treating 
the  linearized  problem  (5.2)  as  the  linear  problem  was  treated  in  [14].  In 
effect,  the  linear  relations  w(z,.  y)  ~  0  are  used  to  solve  explicitly  for  the 
vectors  x< ,  i  *  1,  •  •  •  ,  m,  in  terms  of  xo  and  the  u< ,  i  *=  0, 1,  •  •  •  ,  m  —  1 . 
Substitution  for  the  vectors  x<  in  4>(z)  and  the  inequalities  a/2  —  gi  0 
reduces  the  original  problem  (5.2)  to  on©  in  only  mr  +  «  variables.  This 
reduced  problem  may  then  be  solved  by  an  appropriate  linear  constraint 
method  which  takes  advantage  of  the  particular  form  of  4>.  For  example,  if 
<t>  is  quadratic,  a  quadratic  programming  method  may  be  used. 

In  the  important  case  where  4>  is  linear,  a  further  efficiency  is  made 
possible  by  treating  the  reduced  problem  as  the  dual  problem,  and  solving 
the  corvesnonding  primal  linear  programming  problem.  This  permits  us  to 
take  advantage  of  the  fact  that  the  variables  of  the  reduced  problem  (the 
control  variables)  are  not  required  to  be  nonnegative,  and  that  there  are 
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more  inequality  constraints  than  variables.  The  corresponding  primal 
problem  consists  of  mr  +  n  equations  in  mn  +  k  nonnegative  variables. 
The  numerical  examples  discussed  below  are  of  this  type. 

The  use  of  the  linear  equality  relations  w(z,  y)  =  0  has  the  additional 
computational  advantage  that  no  modification  of  the  true  objective  function 
is  required.  On  the  other  hand  a  possible  theoretical  difficulty  may  arise 
since  even  with  v(z)  convex  it  is  usually  not  true  that  y’  €  IV  when  Ty  is 
determined  by  w(z,  y)  =  0.  Thus  the  monotone  behavior  of  is  not 
guaranteed.  However,  no  such  difficulty  has  been  observed  in  the  actual 
numerical  calculations. 

In  order  to  illustrate  the  application  of  the  iterative  method  we  will 
discuss  two  numerical  solutions  to  a  nonlinear  problem.  The  problem  con¬ 
sidered  is  a  discrete  approximation  to  the  following  continuous  scalar 
(n  =  1)  problem: 

min  j  u(i)  dt, 


subject  to  x  =  f(x,  u),  |u(0|  ^  1,  for  t  €  [0, 1],  and  x(0)  =  1,  x(l)  =  J, 
where  fix,  u)  -  —  § i  +  i!  +  «(/).  An  additional  state  constraint  is 
imposed  in  the  second  example.  The  initial  trajectory  used  to  start  the 
iteration  was  x°(t)  =  1,  for  t  t  [0, 1]. 

For  these  examples  the  simplest  (forward)  finite  difference  scheme  was 
used,  namely, 

(5.3)  xi+ 1  -  Xi  =  Atf\Xi ,  Ui),  i  =  0, 1,  •  ,  m  -  1, 


so  that 


(5.4) 


=  A//(x, ,  Ui)  +  Xi  -  x»+i  =  (1  -  §A<)x.-  +  Af(x,)2 

+  A tUi  —  T,+1 ,  i  =  0, 1,  •  •  •  ,  m  —  1. 


For  Xi1  known,  the  linearized  system  which  must  be  satisfied  by  x/+l  and 
Wi  =  -x£i  +  [14-  Al(2x/  -  f)]x/+1  +  A tUim  -  A /(»>)* 


M{J+I  IS 


(5.5) 


==  0,  i  =  0, 1,  •  •  •  ,  m  —  1. 


This  system  is  solved  using  the  specified  initial  value  for  x(0)  to  give  the 
Xii+1  explicitly  as  linear  functions  of  the  u/+!, 

(5.6)  x,y+I  =  d>+,(«£  1,  •  •  •  ,  W+'),  i  =  1,  •  •  •  ,  m. 


The  following  linear  programming  problem  (in  the  dual  form)  is  then 
solved  at  each  iteration  to  give  the  new  optimal  control  u,y+I, 
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i  f  0, 1,  •  •  •  ,  m  —  1: 


mm 


(5.7) 


[Z  «.•  I  -I  £  *  £  M 

U-o 


0, 1,  •  •  •  ,  m  -  1; 


i  g  4,w(«.-i ,  u*-* ,  •  •  • , «,)  s  i j 


The  corresponding  state  trajectory  x,J  ,  i  *  1,  •  •  •  ,  m,  is  then  given  by 
(5.6). 

The  iteration  was  started  with  x,a  *  1,  i  *  0, 1,  •  •  ■  ,  m,  and  a  value 
of  m  ~  20  (At  =  0.05)  was  used.  The  results  for  the  first  numerical 
example  are  shown  in  Figs.  4  and  5.  Convergence  was  achieved  (within  the 
desired  accuracy)  in  three  iterations.  However,  the  difference  between  x* 
and  x*  =  x  is  too  small  to  be  shown  graphically  (Fig.  4).  Note  the  rapid 
convergence  even  though  the  initial  guess,  x»#,  for  the  trajectory  was  very 
poor  and  did  not  even  satisfy  the  terminal  boundary  condition.  The 
corresponding  optimal  control  u*  is  shown  in  Fig.  5.  The  monotone  be¬ 
havior  of  the  function  value  is  verified  by  the  successive  values  of 
<t>’  -  XT~o'  «/.  Those  were  -  -0.286,  4?  «  -0.946,  and  4>*  -  -0.950. 


1 


Fin.  4.  Initial  and  optimal  elate  trajectoriee  for  nonlinear  numerical  example 


238 


J.  B.  ROSEN 


+1 


TIME 


1.0 


-1 


Fio.  5.  Optimal  control  for  nonlinear  numerical  example 


For  the  second  example  the  state  constraint,  x(|)  g  -$,  was  imposed. 
This  of  course  eliminates  the  solution  shown  in  Fig.  4.  The  sequence  of  5 
state  trajectories  obtained  is  shown  in  Fig.  6.  The  corresponding  function 
values  were  *  2.792,  **  —0.144,  =  —0.656,  4>*  *  —0.972,  and 

-  -1.008.  The  control  from  the  first  iteration  u,1  and  the  optimal 
control  u*  are  shown  in  Fig.  7.  All  of  the  state  trajectories  (except  for  the 
initial  guess)  are  seen  to  satisfy  the  state  constraints.  It  is  interesting  to 
observe  that  the  method  not  only  converges  to  a  different  trajectory  x*l 
but  that  the  added  state  constraint  is  not  active  for  this  limit  trajectory. 
Thus  the  state  constraint  forces  the  solution  away  from  its  previous  se¬ 
quence  and  allows  it  to  converge  to  a  different  local  minimum.  On  the  other 
hand,  in  some  other  nonlinear  state-constrained  cases  which  have  been 
computed  by  this  method,  a  state  inequality  constraint  of  the  type  imposed 
here  has  remained  active  for  the  limiting  trajectory.  Finally,  it  slmuid  be 
noted  that  for  both  cases  the  limiting  control  has  the  properties  of  an  opti¬ 
mal  control  for  a  discrete  linear  problem,  that  »,  n(  - 1)  switchings  and 
m  —  n(  *  19)  values  of  u  *  —  ±  1. 

Appendix.  In  this  Appendix  we  prove  that  the  assumptions  (>)  and  (2) 
of  §3  imply  the  continuity  of  Ty.  We  also  show  the  validity  of  the  modified 
objective  function  (2.13). 


NONLINEAR  OPTIMAL  CONTROL  PROBLEMS 


239 


Fio.  6.  Slat  trajectories  for  nonlinear  example  jnth  added  state  constraint, 


We  first  f  tat*  a  condition  on  the  rank  of  an  augmented  Jacobian  matrix  * 
which  insures  the  satisfaction  of  the  assumption  (2)  of  $3.  In  order  to 
state  this  condition  v  e  must  have  an  explicit  statement  of  the  constraints 
which  define  the  compact  set  Z. 

We  will  assume  that  Z  is  the  polyhedral  set  determined  by  the  system  of 
k  linear  inequalities 

(A.l )  ait  -  bi  £  0,  t  -  1,  •  •  •  , k, 

or 

(A.2)  Z  -  |*  |  A*  -  b  2  0), 

where  A  is  an  *  X  k  matrix  with  specified  columns  o< ,  and  6  €  E*  is 
specified.  Let  2  denote  a  boundary  point  of  Z.  Then  we  must  have  at  least 
one  active  constraint  at  I,  that  is,  a /I  -  -  0  for  at  least  one  value  of  ». 

We  will  denote  by  A(s)  the  matrix  whose  oolumns  represent  the  active 
constraints  at  i.  Similarly,  let  V (u)  represent  the  Jacobian  matrix  of  the 
vector  0(*)  which  contains  all  components  of  v(i)  for  which  *»<(*)  -  n 
That  is,  9(i)  ■»  0,  and  V'ft)  —  0,(*). 
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Fig.  7.  Controls  for  nonlinear  example  with  added  stale  constraint 


We  will  denote  the  boundary  point?  of  S  by  dS.  It  follows  that  for  eveiy 
y  6  dS,  the  matrix 

(A. 3)  B{y)  -  [V(y)  A(y)] 

is  defined  and  has  at  least  one  column.  We  will  say  that  B(y)  satisfies  the 
full  rank  condition  at  y  f  dS  if  the  columns  of  B(y)  are  linearly  indepen¬ 
dent. 

Assumption  (1)  implies  that  B(y)  satisfies  the  full  rank  condition  at 
every  y  €  dS  which  is  also  interior  to  Z.  This  is  true  because  for  such  a 
point  A(y)  -  0,  and  T(y)  certainly  has  full  column  rank  since  it  consists 
of  selected  columns  of  vf  .  Furthermore,  assumption  (2)  is  implied  by  the 
full  rank  condition  on  B,  as  shown  by  the  following. 

Lemma.  Let  B(y)  satisfy  the  full  rank  condition  for  every  y  €  dS.  Then 
for  each  y  €  S,  the  convex  vet  Ty  contains  interior  points. 

Proof.  First  suppose  g  €  5  is  an  interior  point  of  S.  Since  S  C  Zt  g  is  an 
interior  point  of  Z.  Furthermore,  wig,  ff)  »  v(g)  >  0,  so  that  g  is  an 
interior  point  of  W(g).  Therefore  g  is  an  interior  point  of  Tg. 

Now  suppose  g  €  9S.  The  set  Tg  is  the  polyhedral  set  determined  by  the 
k  +  l  linear  inequalities 

(A.4)  r#  -  \z  I  w(z,  y)  £  0,  A'z  -  6  £  0}. 
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Now  consider  the  point  2  *  tf.  We  may  assume  without  loss  of  generality 
that 


(A.5) 

and 

(A.6) 


wdtf,  tf ) 


l  +  !,  *  •  *  » 


_  V,  ,  /-0,  «  -  1,  ••• 


for  some  t  >  0.  Then  the  columns  of  P(tf)  are  the  gradient  vectors  Vi /,(tf), 
i  -  1,  •  •  •  ,  J,  and  the  columns  of  A(jj)  are  the  vectors  o< ,  i  «  1,  •  •  •  ,  i. 
Since  B(y)  satisfies  the  full  rank  condition,. its  columns  are  linearly  in¬ 
dependent  and  there  exists  no  vector  r  £  2?*+l,  except  r  *  0,  such  that 
B(y)r  =  0.  Then  by  a  variation  on  the  Farkas  lemma  (see  [8,  Theorem  2.9, 
p.  48]),  there  exists  a  vector  I  €  F*  such  that 

(A.7)  i'B(p)  >  0. 

Now  consider  the  point 

(A.8)  tf  -  tf  +  il, 

where  i  >  0  is  chosen  sufficiently  small  so  that 

vfVviil))  <t,  i  -  l  +  1,  •  •  •  ,  I, 

(A.9) 

ila,  <  t,  i  -  *  +  1,  •  •  •  ,  *. 

Now  consider  u\(tf,  tf),  i  *  1,  •  •  •  ,  /,  and  alp  -  6, ,  t  -  1,  •  •  •  ,  k.  From 
(A.5),  (A.6),  (A.7)  and  (A.8)  we  have  «\(tf,  tf)  >  0,  *  -  1,  •  •  •  ,  l,  and 
a, tf  —  bi  >  0,  *  -  1,  •••  ,  ft.  From  (A.5),  (A.6),  (A.8)  and  (A.9)  we  have 
u\(tf,  V)  >  0,  *  -  l  +  1,  •  •  •  ,  /,  and  alp  -  6,  >  0,  *  «  £  +  1,  •  •  •  ,  *. 
Therefore,  tf  is  interior  to  every  constraint  of  Tp  and  is  an  interior  point  of 

rtf- 

Theorem  4.  The  mapping  Ty  is  continuous  for  y  £  S. 

Proof.  Because  v(t)  €  C*  on  the  compact  set  Z  a  uniform  bound  y  exists 
such  that  for  any  (2,  y\  y*)  €  «S  X  5  X  S, 

(A.10)  ||w(2,  y‘)  -  10(2,  y*)||  S  yliy1  -  y*ll- 

Also  since  v,(y)  is  of  rank  1  for  y  €  S,  the  symmetric  matrix  vjol  is  positive 
definite  at  every  point  of  S .  Therefore  a  uniform  bound  0  exists  such  that 

(A.11)  ii(Mi/r,ii  *  t 

for  every  y  €  S. 

Suppose  we  are  given  yl  €  S  and  €  Ty1.  Then  given  any  <  >  0,  we 
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now  show  that  we  can  choose  6  >  0  so  that,  for  each  y*  f  S  with 
(|y*  ”  y*l|  ^  4,  we  can  find  r*  €  ry*  such  that  H21  —  2* ||  ^  e. 

If  2*  €  ry*,  the  theorem  is  true  with  2 *  =  21.  Now  suppose  21  $  Fy*,  that 
is,  at  least  one  component  of  w(z\  y *)  is  negative.  Without  loss  of  generality 
we  assume  that  v>i(z\  y *)  <  0,  for  t  =  1,  -  ,  i  ^  /,  and  10,(2*,  y*)  ^  0, 
for  *  ■»  Jfc  +  1,  •  •  •  ,  f.  Since  21  6  Ty1,  we  have  Wi(zl,  y1)  ^  0,  for 
i  ■  1,  •  •  •  ,  /.  Let  ©  6  E1  be  the  vector  with  ©,  =  10,(2*.  y*)  <  0, 
t  —  1,  •  •  •  ,  Jfc,  and  ffi,  «  0,  *  «  k  -f  1,  •  •  ,  Z.  Then 

(A.12)  I©, |  £  |i 0,(2*,  y1)  -  10,(2*,  y*)|,  t  =  1,  •  •  •  ,  l, 

so  that 

(A.13)  ||*||  *£  ||u>(2*,  y‘)  -  10(2*,  y*)||  g  7|!y*  -  y*||, 

where  the  last  inequality  follows  from  (A.10). 

We  first  assume  2*  is  an  interior  point  of  Z.  Then  there  is  an  <j  with 
0  <  «i  £  «,  such  that  z  €  Z  for  ||2  —  2*||  Sj  tj .  Choose  i  =  ti/Py,  and  let 
y*  be  any  point  in  <S  with  ||y*  —  y’|J  ^  5.  Now  choose  ©  as  above,  and  let 

(A.14)  2*  -  2*  -  t-/(y’)MyW)r*©. 

From  (3.2)  we  have 

w(z\  y*)  »  v(y*)  +  v,(y)W  -  y*|  -  t’.(y*)«’/(y')My*)i'/(y,)r1© 

(A.15) 

=  10(2  ,  y  )  —  ©  «£  0, 

so  that  2*  €  Wr(y\).  Furthermore  from  (A.14)  and  (A.ll)  we  have 

(A.16)  Ik1  -  2*|l*  -  ©'[t-.(y*)t-/(y,)r‘©  £  d*:©;i*. 

Since  |jy*  -  y*||  £  *i0y,  we  get  from  (A.16)  and  (A.13)  that 

(A.17)  |>*  -  **•!  £  d!'©!l  S  frrlly*  -  y*!l  £  *.  - 

But  this  shows  that  2*  *■  Z,  and  therefore  2*  (  ry*.  Finally  since  t,  £  «,  we 
have  j|2*  —  2*j|  <t  t,  as  was  to  be  shown. 

The  other  possibility  we  must  consider  is  that  2'  (  Fy*  is  a  boundary 
point  of  Z.  Since  Fy1  has  interior  points  and  is  a  convex  act  there  are  interior 
points  in  the  neighborhood  of  every  point  of  Ty‘  (see,  for  example,  (7J). 
In  particular  there  exist  ,  0  <  n  2»  «/2,  and  z*  (  Ty*,  such  that 
I!*1  —  r'l!  £  «/  2  and  'Jr  -  2*1!  £  tj  implies  that  2  is  interior  to  Z.  Now 
ehooee  4  ■  t*/0y,  and  replace  r*  by  r*  in  the  previous  argument.  This  gives 
a  point  2’ t  Ty*  with  jjr*  —  2,:,!  £  «/2.  It  follows  that  2*  —  2*j|  ^  «. 

We  now  prove  the  statement  about  the  modified  objective  function 
(2.13)  made  at  the  end  of  §2-  We  define  the  t  X  (/  -f  f)  augmented 
Jacobian  matrix  Hiy)  -  (r/ty)  A(y)).  Let  $(2)  be  as  in  (2.13). 
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Theorem  5.  Let  B(y)  have  full  column  rank  for  every  y  £  8.  Then  a  value 
of  a  exists  such  that  every  local  solution  of 

(A. 17)  mir.  1 4>{z)  J  z  £  Z,  v(z)  £  0} 

• 

is  also  a  local  solution  of 

(A. 18)  rain  |$(z)  \  z  f.  Z,  v(z)  *  0}. 

e 

Proof.  Since  4  f  C1  and  B(y)  has  lull  column  rank  on  the  compact  set 
S,  there  are  constants  m  and  «i  such  that  for  any  y  £  S, 

(A.  19)  WlfH.'  £  an, 

and 

(A. 20)  B{y)r:  z  njlrj;,  rf£,+‘. 

We  choose  a  >  a,  n  .  I»t  y*  he  a  local  minimum  of  (A.  17).  Because  of 
the  rank  condition  on  Biy),  the  necessary  Kuhn-Tucker  conditions  are 
satisfied  at  y*.  The  relevant  conditions  are  that  there  exist  vectors  p  0 
and  o  £  0  such  that 

i 

(A. 21)  v.'(y*)p  +  Aiy*)q  =  Ytfty*)  =  V+(y*)  4-  a £  Vi\(y*) 

and 

(A.22)  r,(y*)pi  -  0,  i  *  1,  •  ••  ,  /. 

We  let  r  =  ip,  ~  a,  ,  p  -  a,  </,,■■■,  gi),  and  write  { A.21 )  as 

{ A.23)  B[y*)r  »  V*!y*). 

From  (A  191  ud  iA.20t  it  follows  that 

t A.24 )  ♦,  r  S  -  '!^(y*)Ji  $  a, , 

or  ||r|i  £  a,  ♦,  Hut  this  requires  ja  —  p,\  £  ai/«>  <  a,  t  »  1 

or  p,  >  0,  i  «  1,  ,  Thru  from  i A.22)  we  must  have  r,(y*)  -  0, 

i*l,-,/,  so  that  y*  w  a  feasible  solution  of  (A. 18). 

Now  suppose  y*  is  not  a  local  minimum  of  IA.18).  Then  for  some  point 
y5  £  Z,  arbitrarily  close  to  y*.  we  have  r(y')  ■  0  and  #(y\)  <#(y*) 
But  then  <My')  <  #(y*),  so  that  y*  is  not  a  local  minimum  of  (A. 17). 
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Numerical  Solution  of  Optimal  Control  Problems 


J.  B.  Rosen 
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University  of  Wisconsin 

Summary 

I.  The  discrete  optimal  control  problem  to  be  considered  is  as 
follows: 

Let  x  c  En  denote  the  state  vector  at  time  t  ,  and  u,  €  Er 
1  1  1 

the  corresponding  control  vector.  The  system  dynamics  are  given  by 

+  (  xj  *•  i(x  ,  u  )  .  1  =  (1.1 

whore  the  controls  u  must  be  selected  so  that 

i 

u  e  U  c  Er  .  i  =  0,  t ,  . .  .m-l 

1  *  n 

x  €  X  E  .  i  =■  0,  l ....  m 

l  i 

It  is  assumed  that  the  sets  and  X  ore  compact  and  convex;  and 

that  f  is  continuous  on  x  X^.  We  call  the  sequence  (x^)  the 

tate  trajectory,  the  sequence  (u^ j  the  control,  and  we  denote  by 

r  fx  ,  Uj)  the  direct  product  of  these  two  sequences.  We  say  that 

z  is  admissible  if  and  fu^)  satisfy  (1.1)  and  (1,2).  Note 

that  we  can  ?  pecify  the  initial  and  terminal  values  x  end  x  by 

o  m 

setting  X  =  x  and  X  =  x  . 

o  o  m  "* 


z 


f 


We  assume  a  to  be  continuous  on  x  and  define 
m-l 

9(z)=  T,  o<x.,u) 
i=0  1  1 


(1.3) 


We.  wish  to  find  an  admissible  z*  such  that  <p(z)  attains  its  minimum, 
over  all  admissible  z  ,  at  z  =  z*  . 

Now  let  us  consider  the  following  continuous  optimal  control 
problem.  Let  x(t)  and  u(t)  satisfy 


•  _  dX  7, 

*  ’  57 , " f(x 

u(t)  6  U(t) 
X(t)  €  X(t) 


t  €  [  0,  Tj 


(1.4) 


Find  x*(t)  satisfying  (1.4)  such  that 


?{x(t),  u(t))dt 


(1.5) 


attains  its  minimum  over  all  x(t)  and  u(t)  which  satisfy  (1.4)  .  Suppose 
that  we  choose  a  finite  difference  step  At  =  T/m  ,  and  use  the  simplest 
approximation  x  (iA t>  =  (x.  +  J  -  x^/At.  We  also  evaluate  the  integral 
(> .  3)  by  the  trapezoidal  rule  and  let  f  =  AtT  and  p  =  At-jy  .  We  then 
formally  obtain  the  equivalent  discrete  problem  (l.i),  (1.2)  and  (1.3). 

The  terminal  time  T  is  assumed  to  be  specified  in  the  continuous 
problem  as  given  by  (1.4)  and  (1.5).  We  can  however  put  a  problem  with 
variable  terminal  time  into  this  fixed  time  formulation  by  introducing  an 
additional  state  variable.  To  illustrate  this,  suppose  the  variable  time 


problem  is  given  by 


3 


dy  .  .  _n-l 

=  g(y.  u),  y,  g  €  E 

T 

7}  (y,  u)  dr  =  min,  T  variable 

We  introduce  a  new  state  variable  £  >  0,  and  let  t  =  §t,  t  €  [0,  l] , 
We  require  that  £  satisfy  f  =  0,  and  that  its  initial  value  £(0)  be 
determined.  If  we  define  the  vectors  in  En  , 


and  let  u(x,  u)  =  £t)( y,  u),  the  resulting  problem  given  by  (1.4)  and 
(!  ,5)  with  T  -  l  is  equivalent  to  the  variable  time  problem. 

It  should  also  be  remarked  that  explicit  dependence  on  t  of  ^ 
and  f  can  be  handled  with  no  essential  difficulty.  Such  dependence 
leads  to  functions  ^  and  f^  in  (1,1)  and  (1.3)  which  depend  explicitly 
on  the  index  i  .  To  simplify  the  presentation,  we  will  not  consider  such 
dependence , 


II.  We  will  now  sh^w  that  the  discrete  optimal  control  problem  can  be 
considered  as  a  mathematical  programming  problem  (in  general,  nonlinear) 
with  a  special  structure  ( l] .  We  let  s  =  mr  +  (m+l)n,  and  consider  the 

r-  Q  S 

vector  z  in  the  product  space  E  -  We  denote  by  Z  c  E  ,  the  compact, 
convex  subset 


Z 


uA  e  i  =  0.  I,  . .  ,m-l 

Xj  €  1  =  0,  1,  . .  .m 


(2.1) 


We  also  define  a  vector  mapping  v:  E8-*  Emn,  so  that  the  recursion 


4 


relations  (t .  l)  are  all  given  by  v(z)  =  0.  That  is  we  define 
v  t  =  ffX  ’Ujl  +  x  -x  .  1  =  0,  l 


(2.2) 


and  let  represent  the  components  (i-l )n+ 1,  . . .  .in,  of  v.  The 
discrete  optimal  control  problem  can  now  be  stated  as  that  of  finding  a 
z*  which  solves  the  mathematical  programming  problem 


min 

z 


( 


<PU) 


Z  €  Z 
v(z) 


0  } 


(2.3) 


where  cp(z)  is  given  by  (1.3)  . 


The  admissible  (feasible)  set  S  c  E  is  given  by 


s  .  f  ,  I  *  '  2 

l  I  v (z)  =  0 

The  set  S  may  be  empty,  in  which  case  no  control  and  corresponding 
trajectory  exist  which  satisfy  (1 . 1)  and  (1.2).  In  many  practical  situations 
the  existence  of  an  admissible  control  and  trajectory  with  the  given  dynamics 
and  imposed  constraints  is  the  primary  question.  If  no  admissible  solution 
exists  it  is  necessary  to  relax  the  control  constraints  (by  Increasing  the 
allowable  range  on  some  of  the  controls,  for  example)  or  relax  the  state 
constraints  (by  increasing  the  size  of  the  target  set  Xm,  for  example) 
before  the  determination  of  an  optimum  solution  can  be  considered.  In 
some  cases  an  admissible  solution  may  also  be  achieved  by  an  appropriate 
modification  of  the  system  dynamics .  In  any  event,  the  determination  of 
whether  or  not  an  admissible  solution  exists  has  been  reduced  to  finding 
any  feasible  solution  to  the  problem  (2,3)  , 

If  S  is  not  empty,  it  is  a  compact  set,  so  that  qp  attains  its 
minimum  on  S  .  If  the  null  space  of  v  is  convex,  then  S  is  also  convex. 


} 


(2.4) 
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If  cp  is  convex  on  Z  (o  convex  on  x  U^),  then  (2.3)  is  a  convex 
programming  problem  for  which  both  necessary  and  sufficient  optimality 
conditions  can  be  stated,  and  for  which  efficient  computational  methods 
of  solution  are  available.  It  follows  from  (2.2)  that  for  linear  f, 
f  =  Ax  +  Bu  +  q  ,  where  A  and  B  are  matrices  and  q  is  a  constant 
vector,  the  null  space  of  v  will  be  a  linear  manifold  (and  therefore 
convex).  Thus  linear  dynamics  and  a  convex  functional  lead  to  a  reason¬ 
ably  well -understood  convex  problem.  In  general,  however,  if  f  is  not 
linear,  the  set  S  will  not  be  convex.  Necessary  optimality  conditions 
will  be  given  for  S  nonconvex,  but  in  general,  conditions  which  are 
also  sufficient  are  not  known  for  such  problems.  Furthermore,  for  non¬ 
convex  S  a  problem  may  have  many  constrained  local  minima  even  with 
cp  linear.  Thus  even  if  a  method  finds  a  local  minimum  such  a  minimum  may  be 
far  from  the  desired  global  minimum. 

III.  We  will  now  consider  optimality  conditions  for  the  problem  (2.3), 

and  use  these  to  obtain  the  adjoint  equations  and  a  "minimum  principle" 

for  the  discrete  optimal  control  problem.  We  assume  that  <p  and  v  are 

in  C'( Z).  and  that  S  is  nonempty.  We  denote  by  v  the  Jacobian  matrix 

z 

of  a  function  v  ,  and  p  transpose  by  p'  so  that  p'v  denotes  an  inner 
product. 

Sufficiency  Iheprw 

Let  <p  be  convex  and  v  be  linear  on  Z  .  A  sufficient  condition  that 
z*  e  S  solves  (2.3)  is  that  there  exists  p  e  Emn  such  that 


Proof:  Since  <p  is  convex  and  v  linear,  the  function  f  -  +  p'v  is 

convex  on  Z  .  Then  for  every  z  €  Z,  y(z)  “  f(z*)  *  f£(z*)  [z-z*]  z  0, 
by  (3.1).  Since  z*  €  S,  v(z*)  =  0,  so  that  we  have 
<p(z)  -  q>(z*)  >  -p'v(z)  =  0,  V-  z  e  S  . 

Thus,  z*  is  a  global  minimum  on  S  . 

Necessity  Theorem 

Assume  that  the  compact,  convex  set  Z  has  interior  points.  Let 
7+  solve  (2.3).  Then  there  exists  a  scalar  (j,  s  0,  and  p  €  Emn,  not 
both  zero,  such  that 

#  (z*)(z-z*)  >  0,  Vz  €  S  (3.2) 

z 

where 

$(z)  =  pi«P(z)  +  p'v(z)  (3.3) 

The  proof  of  this  theorem  is  too  long  to  be  included  here,  and  is  given 

in  { 2] .  It  should  also  be  noted  that  if  an  appropriate  constraint  qualification 

is  satisfied  we  can  choose  p,  =  l . 

If  we  restrict  the  functions  qp  and  v  as  in  the  sufficiency  theorem, 
we  obtain  a  Minimum  Principle. 

Let  <p  be  convex  and  v  linear  on  Z,  and  let  z*  solve  (2.3). 

Then  there  exist  multipliers  p.  >  0,  and  p  ,  not  both  zero,  such  that 
<t>(z)  attains  its  minimum  over  z  €  S  at  z*  ,  where  <J>(z)  is  given  by  (3.3). 
The  prool  follows  immediately  from  the  convexity  of  <t>  and  (3.2)  . 


We  are  now  in  a  position  to  apply  these  results  directly  to  the 


n 


discrete  optimal  control  problem.  If  we  denote  by  p(  €  E  ,  the 

multipliers  corresponding  to  ,  we  obtain  from  (1.3)  and  (3.3). 

m-l  m-1 

*e»  ^  ui>  *  d  o  p‘i+1  t«V  V +  *,  - W  <3-«> 

We  will  let  f*  3  f  (xF,u*),  etc.,  and  p  =0.  Then  the  necessary 
xi  x  1  1  o 

condition  (3.2)  can  be  written 

lf,1iiXi  <3-5) 

1=0,  l,  . .  .m-l 


-p'  (x  -x*  )  >  0,  V  x  €  X 
r  m  '  m  m  m  m 


(3.6) 


*Vui  , 

1=0,  1,  . .  .m-i 


(3.7) 


If  we  assume  that  the  initial  state  is  specified  as  a  point,  l.e.,  X  =  x  • 

o  o 

then  x*  =  JTq  and  (3.5)  is  satisfied  for  1  =  0.  Now  suppose  further 
that  no  state  constraints  are  active  except  for  the  terminal  constraint,  i.e 
x*  c  int  Xj,  1  =  1,. .  .m-l .  Then  (3.5)  requires  that  the  expression  [  •] 
vanish.  That  is 


pV  -  z  p'j-j. i  ^ xi  ^  ^^^xi  ’  i  *  m-l ,  m-2,  ...0.  (3.8) 

The  multipliers  p1  are  therefore  determined  by  the  recursion  relation 

(3.8)  starting  with  a  vector  pm  satisfying  (3.6).  For  x*m  on  the 

boundary  of  X  ,  p  must  be  parallel  to  the  (outward)  normal  vector 
m  m 

to  a  supporting  hyperplane  at  x*  .  The  recursion  relation  (3.8)  Is  seen 

m 

to  be  a  finite  difference  approximation  to  the  adjoint  differential  equation 


8 


for  the  continuous  optimal  control  problem. 

Let  us  now  define 

H(x,  u,  p,  q)  =  p'f(x,  u)  +  (p-q )‘x  +  ^  <j(x,  u)  (3.10) 

By  rearranging  terms  we  obtain  from  (3.4) 

m-l 

♦(*)  =  2  H(x  ,  u  ,  p  p  )  -  p  x  (3.11) 

o  l  i  i+i  l  mm 

For  o  convex  and  f  linear  on  X,  x  Uj  ,  the  previous  minimum  principle 

*  * 

applies.  The  optimal  trajectory  (x^ )  and  control  fu^  }  must  therefore 
satisfy  the  following 


ete  Minimum  Principle 
If  [x*\  and  (Uj*)  are  optimal,  then 

H(x*.  u*  pi+l’  pi>  5  H(JV  ui*  pi+i'  pi)»  V- xL  e  c  UA  (3.12) 

i=0,  1,  . . .  .m-l 


where  the  adjoint  vectors  pA  are  determined  by  (3.6)  and  (3.8). 


r- 


IV.  In  order  to  describe  the  computational  solution  of  the  discrete  optimal 
control  problem  we  first  consider  a  linear  recursion  relation  with 
f  =  Ax  +  Bu,  and  also  give  a  more  explicit  statement  of  the  constraint 


sets  X  and  U  .  To  simplify  the  discussion  we  will  assume  that  X  =  x 


and  that  the  X^  and  are  polyhedral  sets  defined  by  the  linear  inequalities 


Xt  =  {  x  J  G'jX-5^0),  i  =  l,....m 
Uj  *  {  u  |  H’u-hi  o}  ,  1  =  0 . m-i 


(4.1) 


where  and  H  are  constant  matrices  and  y  and  h  are  constant 
vectors.  If  these  sets  are  Just  upper  and  lower  bounds  we  have 


9 


Gi  *  Un  -In)  and  H  *  (Ir  -If)  . 

The  problem  (1.1)  -  (1.3)  now  becomes 


r 


min 

xi'ui 


< 


m-1 

2  otx  .Uj) 
i*0 


V. 


xi+i  * 

Gixi  *  «i  k  °*  1  *  l» 
H'u.  -  hi  0,  i  *  <£ 

l  > 


i  •  0,  ...«-l  I 

>  (4.2) 

...a  | 


. .  .ra-1  J 


If  a  I*  convex,  this  is  a  convex  programming  problem  with  linear  *ouality 
and  inequality  constraints .  It  consists  of  m(n+r)  variables,  nm  equality 
constraints,  and  2m(n+r)  inequality  constraints  if  they  are  bounds.  This 
could  be  solved  directly,  but  will  be  a  large  problem  if  m  la  large.  A 
more  efficient  method  of  solution  is  to  use  the  linear  equations  to  eliminate 

V 

the  state  vectors  x^  and  map  the  entire  problem  into  the  control  space. 
This  is  done  by  means  of  the  following 


Ifmmi 

Let  xJ  satisfy  dte  recursion  relation 

xi«fl  *  *ixi  +  bi*  1  *  °» •••*“*  (4.3) 

with  xq  specified  and  nonsingular.  Then 

4  *»***,  vv  1  “••••“  <4-4> 

where  the  matrices  Yi  and  Ai  satisfy 

Yiel"Vi»  4 -0*  !••••*-»  (4.S) 

and 

A\  ■  A'1+i  Kj  .  AJ^  ■  ,  i  •■-1....1  (4.4) 
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If  we  apply  this  to  the  equality  constraints  with  =  I  +  A  ,  and 
bj  =  BuJ  we  obtain 

i  -  1 

xi  =  Yixo  +  2  ^'iJUj  ’  i  =  (4.7) 

where  Q'^  =  imPortant  Point  to  note  is  that  given  the 

matrices  A  and  8  .  we  can  explicitly  compute  the  matrices  Y^  and 
by  operations  with  (nxn)  matrices .  Since  n  (the  dimensionality  of  the 
state  vector)  is  usually  small  compared  to  m  ,  this  is  an  efficient 
computation. 

We  now  use  (4.7)  to  eliminate  the  x^  from  the  state  constraints 
and  o  lo  (4.2).  The  state  constraints  can  then  be  written 


i-t 

X  D1  u  -  d  a  0,  1*  l.M.m  (4.8) 

j=  0  1J  1  1 


where  D'  =  G'  0!.  and  d  =  <J\  -  G'  Y,  x  .  A  similar  substitution  in 
ij  i  ij  i  i  i  i  o 


gives  an  objective  function  p(u)  depending  on  the  controls  u-  (u  ), 
only.  If  a  is  convex.  p(u)  will  also  be  convex  because  of  the  linearity 
of  (4.7).  The  problem  (4.2)  has  therefore  been  reduced  to 


min 

U1 


p(u) 


i-t 

I  D'  u  a  d  .  i  *  l,  . .  .m 
j  =  0  J 

ffu  ah,  i  =  0,  . .  .m-t 


(4.9) 


v  y 

This  reduced  problem  has  rar  variables,  no  equalities,  and  the  same 

number  of  inequalities  as  (4.2).  In  most  control  problems  r  <  n,  so  that 
we  have  cut  the  problem  size  at  least  in  half.  Since  there  are  more  inequality 
constraints  than  voriables.  and  no  nonnegativity  requirements,  this  is  best 


1 1 

solved  by  a  convex  method  in  the  dual  apace  (such  as  gradient  projection). 
Computational  efficiency  is  also  improved  by  taking  advantage  of  the 
upper  triangular  structure  of  the  first  2mn  constraints  and  the  block 
diagonal  structure  of  the  remaining  2mr  constraints. 

In  the  important  case  where  c  Is  linear,  (4.9)  should  be  considered 
as  the  unsymmetric  dual  problem.  The  equivalent  primal,  with  mr  rows 
is  then  efficiently  solved  by  any  standard  LP  routine,  which  of  course 
gives  the  desired  dual  variables  (the  controls)  as  the  elements  of  the 
pricing  vector.  For  a  more  complete  discussion  of  the  linear  recursion 
relation  problem,  see  f  t  j . 

V.  The  method  of  the  previous  section  can  only  be  applied  directly  when 
the  system  dynamics  are  described  by  a  linear  recursion  relation.  However, 
with  appropriate  convexity  requirements  on  f  ,  the  nonlinear  problem  can 
i>o  solved  by  an  iterative  solution  of  linearised  problems.  At  each  Iteration 
function  f  is  linearized  about  the  previous  state  and  control,  and  a 
:r,<'ar'z.‘d  pronlem  ot  the  kind  discussed  above  is  solved.  This  method  is 


tuily  described  in  (  3  j  „ 


12 


REFERENCES 


J.  B.  Rosen,  "Optimal  control  and  convex  programming1 ,  Froc. 

IBM  Symp.  on  Control  Theory  and  Appl.,  Yorktown  Heights, 

N.Y.  (Oct.  1964)  py.  223-237  . 

O.  L.  Mangasarian,  Nonlinear  Programming.  McGraw  Hill 
(to  be  published  in  1968),  Chap.  U. 

J.  B.  Rosen,  "Iterative  solution  of  nonlinear  optimal  control  problems", 
J.  SIAM  Control  4  (1966),  pp.  223-244. 


COMPLEMENTARY  SLACKNESS  IN  DUAL  LINEAR  3YSTT3-E 


The  first  part  of  this  paper  develops  a  convenient  technique  for  working  with  a 
pair  of  dual  (i.e.,  complementary  orthogonal)  linear  subspaces,  E  and  X,  in  a 
linear  space  of  n- tuples  (from  an  ordered  field).  The  second  part  deals  with  the 
fundamental  existence  theorem  that  the  dual  subspaces  E  and  X  must  contain  n- 


tuples  5  and 
i  =  l,...,n. 

x  such  that  Jj 
Applications  of  tr 
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2=3  and  ^=1/  5=4) 
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*0 

*2 

-1/4  -6/4 

*”X2 

>v2 

6  1 

-9 

»o 

63 

2/4  0 

a-X 

3 

=  41 

H2  “*3 

“*4 

( standard  form) 


(canonical  form) 


The  above  ^-schema  exhibits  two  dual  systems  of  homogeneous  linear  equations 
in  (Baatzig’s)  standard  form."  The  "greek”  system  XA  "  S  determines  the  row- 
space  of  the  matrix  A,  i.e.,  the  linear  aubspace  H  of  all  row -vectors  £  ex¬ 
pressible,  via  parameters  as  linear  combinations  of  the  rows  of  A,  The  "latin 

oystea  Ax  *  0  determines  the  orthogonal  complement  in  n-space  of  the  row-space  of 
a,  i.e.,  the  linear  subspace  X  of  all  columa-vectors  x  orthogonal  to  each  row 
of  A  —  and  therefore  to  each  £  of  E,  since  £x  =  (xA)x  -  \(Ax)  -  0.  Let  rank 
A  *  r;  then  din  E  *  r  and  dim  X  -  n-r  *  s,  (Any  matrix  X  that  is  row-equiva¬ 
lent  to  A  (including  insertion  or  deletion  of  rows  of  zeros)  yields  an  X-scheoa, 
with  its  own  parametric  £,  which  determines  the  same  dual  subspaces  E  and  X.] 

Nov  partition  Ax  •  0  into  Ax  +  Ax  ■  G,  where  the  subaatrix  A  consists  of 
r  linearly  independent  columns  of  A.  (There  are  at  most  n.'/r.'ej  such  parti¬ 
tions.)  Since  A  provides  a  basis  £or  columns  of  A,  there  exists  an  r  by  e 
matrix  that  the  su^matrix  A  «  AM,  Gauss-Jordan  (complete)  eliainatiOR 

jeducj«p  Ax  +  Ac.  *  0  to  lx  +  Mk’  *  0,  i.e.,  HX  •  -x,  and  reduces  AA  «  t  bo 
i'A  *  T5 is  above  M-schema  exhibits  these  reduced  aysteas,  which  are  dual  linear 

systems  in  (Dantzis*s)JVanonlcal  fora";  note  that  r,...,r  index  .Vie  (basic)  col¬ 
umns  c  A  and  that  r+l,..,,n  index  the  (non-btiaic)  columns  of  A, 


Hvot  v”.  any  Rente ro  entry  n  to  exchange 


i 


*1 

follows: 


'pq  - - ^  *p  vith  Xr^  v 

(but  with  uo  other  marginal  changes)  to  get  an  R-schena  with  entries  as 


with 


B 


aij 


-  n.  _  a  ,/a  „ 

iq  pj'  pq 


fr  ** ,  1^0pq'  apJ  *  “iq  iq'-pq’  ~ij  "  “ij  yj 

Ucr  each  i  f  p  and  each  4  j  q).  Note  the  sign  change  from  a,  to  I  if 

m  ^  ft  _ _  ~  .  -  -  _  I  _ 

pq 


mm 

>  0,  and  from  to  if  <  0  (as  exemplified  below).  By  a  finite 


2. 


succession  of  such  pivot  steps  (and  rearrangements  of  rows  and/or  columns)  we  can 
pass  from  any  M- schema  for  E  and  X  to  any  other  (in  the  "combinatorial 
equivalence"  class).  For  example, 


*1 

x4 

-1/4 

~TJh 

2/4 

0 

a  S  ^ 


The  double  errows  indicate  pivot  steps  (in  either  direction)  and  appropriate  rear¬ 
rangement.  There  are  just  five  M- schemata  in  this  case  (not  kl/2121  =  6) 
because  the  2nd  and  4th  columns  of  the  initial  matrix  A  are  linearly  depen¬ 
dent. 

*  *  * 

We  now  turn  our  attention  to  the  (polyhedral)  cones  E  end  X*  in  which  the 
dual  subspaces  E  and  X  intersect  the  (closed)  orthant  of  all  nonnegative  n- 
tuples.  Borrowing  a  convenient  term  from  .Linear  Programming,  we  say  that  any  t 
in  E*  or  any  x  in  X*  is  feasible ;  i.e.,  |  is  feasible  if  AA  =  6  >  0  for 
some  K,  and  x  is  feasible  if  Ax  =  0  and  x  >  0.  Note  that  |x  =  0  “implies, 

for  any  feasible  £  and  any  feasible  x,  that  Lx,  -  0,  i.e.,  =  0  or 

Xj,  =  0  (or  both),  for  i  ■  l,.,.,n.  x  x 

,  ..We  say  that  a  feasible  £  or  x  is  basic  if,  for  any  partition  of  A  into 

A,  A  (as  specified  above),  exactly  one  component  of  |  or  x  is  positive  and  if 
|  or  x  is  normalized  so  that  the  sum  of  the  components  is  one.  Hence  a  basic 
feasible  §  or  x  corresponds  tc  a  nonnegative  row  or  nonpositive  column  of 
some  M-schema: 


where  +,  ©,  0,  9,  -  denote  numbers  that  are  positive,  nonnegative,  zero,  non¬ 
positive,  negative,  respectively.  In  the  above  example  £  =  (l/3,0, 2/3,0)  and 
x  =  (0, 3/5, 0,2/5)  are  basic  feasible  4-tuples.  Clearly  the  set  of  basic  feas¬ 
ible  £'s  (or  x's)  is  finite,  since  the  class  of  M-schemata  is  finite.  [The 
reason  for  the  term  'basic"  in  this  context  is  that  it  can  be  shown  that  the  basic 
feasible  g's  or  x's  determine  the  "extreme  rays"  of  the  cone  E*  or  X*; 
i.e.,  that  any  feasible  £  or  x  is  expressible  as  a  nonnegative  linear  combina¬ 
tion  of  the  basic  feasible  £'s  or  x's.] 


f 


L2M4 A.  21  tiier  there  is  an  all -positive  (feasible)  £  or  there  ia  a  basic 
feasible  x. 

Proof,  Let  L(r,s)  assert  this  Leona  when  din  E  *  r  and  dim  X  *  is.  We 
use  induction  on  r+s  *  n.  L(l,s)  and  L(r,l)  are  trivial  to  prove. 

We  prove  L(r,s)  for  r  >1  and  a  >  1,  assuming  L(r-1,  s)  and  L(r,  s-i). 
Using  L(r-1,  s)  for  the  first  r-1  rows,  we  get 

_  +  0  ...  0 

+ 1  t  i  r®  I**-© 


In  the  left  alternative,  perturb  0  to  a  small  e  >  0  to  get  an  all-positive  £, 
In  the  right  alternative,  there  is  a  basic  feasible  x  showing,  unless  the  corner 
entry  marked  ?  is  positive.  If  this  corner  entry  is  positive,  pivot  on  it  and 
then  use  L(r,  s-l)  on  the  last  s-1  columns  to  get 


+  G 
+  + 


last 

—  a-l  — 
columns 


In  the  left  si  ter native,  there  is  an  all-positive  £,  and  in  the  right  alternative 
a  basic  feasible  x.  L(r,s)  is  now  inductively  established  for  all  r  and  s. 

The  above  Lemma  is  essentially  "the  theorem  of  the  alternative  for  matrices" 
used  by  J.  von  Neumann  and  0.  Margenstern  to  prove  the  Main  ("MLnimax")  Theorem  in 
their  THEORY  OF  GAMES  AND  ECONOMIC  BEHAVIOR.  There  is  a  dual  Lemma  with  £  and  x 
interchanged  (i.e.,  using  -M ,  in  place  of  M).  The  two  Lemmas  combine  to  show 
that  there  is  either  a  basic  feasible  1  ora  basic  feasible  x  (or  both), 

COMPLEMENTARY  SLACKNESS  THEOREM.  There  exists  a  feasible  £  and  a  feasible 
x  such  that  £  +  xT  >  0  (in  all  components). 

In  the  example  above,  £  *  (l,0,3,0)  and  xT  *  (0, 3,0,2)  yield 
£  +  xT  *  (1,3, 3,2)  >  0. 

Proof  (by  T.  D.  Parsons).  Let  T(s)  assert  this  Theorem  when  dim  X  »  s. 

We  use  induction  on  s«  T(l)  is  trivial  to  prove. 

We  prove  T(s)  for  s  >  1,  assuming  T(s-l).  Apply  L(r,s)  to  get 


In  the  left  alternative,  taking  x 
alternative,  we  take  the  two  cases 


4 


+  ©  © 


=-(+) 


In  the  left  case,  we  get  |  «  0  and  x  >  0  "by  taking  the  first  +  at  the  top 
large  enough,  after  the  remaining  +'s  at  the  top  are  taken  arbitrarily.  In  the 
right  case,  we  use  T(s-l)  opposite  the  0's  in  the  first  column  and  then  make 
the  first  +  at  the  top  large  enough.  Thus,  as  indicated,  we  get  {  +  x^  >  0 
because  of  T(s-l)  for  the  components  ©.  T(s)  is  now  inductively  established 
for  all  s. 

This  Theorem  is  Theorem  1  (and  3)  in  Paper  1  by  A.  W.  Tucker  in  LINEAR  IN¬ 
EQUALITIES  &  RELATED  SYSTB6,  ed.  by  H.  W.  Kuhn  &  A.  U.  Tucker  (Princeton,  1956) 
and,  with  a  geometric  proof,  "Key  Theorem"  in  R.  A.  Good,  "Systems  of  Linear  Rela¬ 
tions,"  SIAM  REVIEW  1  (1959)  1-31«  Complementary  slackness  refers  to  the  existence 
of  feasible  £  and  x  with  "slack"  f>  b),  for  each  column  of  A,  in  either 
or  x.  (but  not  both).  The  following  axe  Corollaries  (see  papers  cited  above  for 
references ) : 

i.  (Gordon,  1673)  Ax  -  0  has  a  solution  x  >  0  and  $=  0  iff  U  >  0  for 
no  2.  (Stiemke,  1915)  Ax  »  0  has  an  all -positive  solution  x  iff  \A  >  0 

and  if  0  for  no  3.  (Farkas,  1902)  b\  >  0  for  all  k  such  that  kA  >  0 

iff  Ax  <=  b  for  some  x  >  0.  4.  cx  >  0  for  all  x  >  0  such  that  Ax  *  0  iff 

^ A  +  c  >  0  for  some  k.  *  2'  If  M  is  skew^- symmetric  (i.e.,  !r  *=  -M),  then 
\u  =  X  "has  a  solution  £  >  0  such  that  £  +  g  >  0, 

Corollary  2  (“  Theorem  5  in  Paper  1  cited  above)  can  be  used  to  establish  the 
duality  and  existence  theorems  of  Linear  Programming  (see  A.  J.  Goldman  and  A.  W. 
Tucker,  "Theory  of  Linear  Programming,"  LINEAR  INEQUALITIES  &  RELATED  SYSTEMS,  pp. 
53-62,  and  R.  A.  Good's  paper  cited  above,  pp.  17-21). 


O.N.R.  Logistics  Project,  Princeton  University,  July  1967 


A.  W,  Tucker 


FINDING  TEE  POINT  OF  A  CONVEX  POLYHEDRON  NEAREST  A  GIVEN  POINT 

Problem:  To  minimize  half  the  square  of  the  distance  from  the  point  (2, 3,-1) 
to  tES~f solid )  tetrahedron  determined  by 

w  =  -x  -  y  -  z  +  3  >  0,  x  >  0,  y  >  0,  z  >  0  • 

The  "objective  function"  to  be  minimized  subject  to  these  constraints  is 

f(x,y, z)  =  |»(x-2)2  +  (y-3)2  +  (z+l)2]  =  7  -  2x  -  3y  +  z  +  ~(x2+y2+z2)  . 

A  necessary  and  sufficient  condition  that  a  point  (::,y,z)  of  the  tetrahedron 
yield  the  required  minimum  is  that  the  gradient  of  f  at  the  point  (x,y, z)  be 
expressible  as  a  nonnegative  linear  combination  of  the  inward  normals  to  the  con¬ 
straints  that  are  "active"  at  the  point  (x,y, z).  That  is, 


x-2 

Vf  =  y-3 


-r  fi1  ~oi  fo 

=  u)-l  +  £  0  +  q  1  +  i,  0 

-ij  L°J  l°]  U 


where  i ,  q,  £  are  each  nonnegative  and  must  be  zero  if  the  corresponding  con¬ 
straint  is  "slack"  (>  0)  at  the  point  (x,y,z).  Hence 

to>0,  5=oj  +  x-  2>0,  +  {,  «=co  +  z  +  l>0 

and 

yw  +  £x  +  qy  +  £z  =  0  . 

Another  way  of  getting  the  same  information  is  to  form  the  Lagrangian  function 

4  ”  f  -  ww  -  $x  -  qy  -  £z 

with  nonnegat-ive  Lagrange  multipliers  u,  £,  q,  £  and  take  ^  *  0  with  /  *  f 
(i.e,,  gjw  +  {■x'V  qy  +  £z  *  0).  Substituting  for  f,  w,  £,  q,  £  from  above,  we  get 

12  2° 

jrf(x,y,z)  «*  7  -  2x  -  3y  +  z  +  ~[x  +y  +z*~3  -  u(3-x-y-z)  -  (u+x-2)x  -  (urf-y-3)y  -  (uH-z+l)z 


3  7  -  3o  -  |-[x2+y2+z2]. 


Hence 


f  +  4  *  -3w  -  2x  -  3y  +  z  +  l1*  • 

The  above  linear  equations  for  w,  £,  q,  £,  a r.l  f  +  4  8X8  exhibited  most  con¬ 


veniently  in  the  following  schema: 
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In  addition  w,x,y,z  and  u, £,q,£  are  all  to  be  nonnegative  and 

f-^»uw  +  £x  +  qy  +  £z«*0 

We  seek,  therefore,  a  solution  of  the  linear  system  in  the  above  schema  that  is 
nonnegative  in  all  eight  variables  (excluding  f+4)  and  is  such  that  u  *  0  or /and 
v  «  0,  %  “  0  or/and  x  *  0,  q  •  0  or/and  y  ■  0,  (,  ■  0  or/and  z  »  0.  Pt>r  this 
purpose  we  turn  to  schemata,  such  as  the  following,  that  are  equivalent  to  the  above 
schema  under  principal  pivoting : 


2. 
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We  get  the  first  schema  above ,  which  we  dub  the  yz-schema  (in  terms  of  the  latin 
"inputs"  y,z  at  its  left  margin),  from  the  original  xyz-schema  by  pivoting  on  its 
starred  principal  (diagonal)  entry.  Pivoting  on  the  starred  principal  entry  of  the 
yz-schema  we  get  the  z-schema  above,  and  then  by  its  starred  pivot  the  null-schema 
above.  Also,  by  pivoting  on  the  principal  entry  2  in  the  z-schema,  and  then  rear¬ 
ranging,  we  get  the  w z-schema  above. 

A  solution  to  our  minimum-distance  problem  can  be  read  from  the  wz-schema:  set 
the  inputs  |  =  0,  q  =  0  and  w  =  0,  z  =  0  to  get  x  =  1,  y  **  2  and  u  =  1,  £  =  2 
and  f+^  ■  3.  Also  f-^  «  cjw  +  |x  +  qy  +  £z  =  0,  so  f  =  /  =  3/2.  Hie  point 
(1,2,0),  on  the  edge  w  =  0,  z  =  0  of  the  tetrahedron,  is  the  point  of  the  tetra¬ 
hedron  nearest  to  the  given  point  (2,3, -l).  Half  the  square  of  this  minimal  dis¬ 
tance  is  3/2. 

In  the  z-schema  set  the  inputs  u=0,  £=0,  n=0  and  z  =  0  to  get  w  =  -2, 
x  “  2,  y  =  3  and  £  =  1  and  f+^  =  1.  Here  again  f-p=  uw  +  £x  +  qy  +  £z  =  0.  The 
point  (2,3,0)  is  the  foot  of  the  perpendicular  from  (2, 3,-1)  to  the -plane  z  =  0 
bounding  the  tetrahedron  but  w  *  -2  shows  that  the  point  (2,3>-l)  lies  on  the 
wrong  side  of  the  plane  w  =  0.  Also,  in  the  yz-schema  set  the  inputs  u>  =  0, 
i  -  0  and  y  =  0,  z  =  0  to  get  w  =  1,  x  =  2  and  q  =  -3,  £  =  1  and  f+/  =  10. 

Again  =  uiw  +  £x  +  qy  +  £z  =  0.  The  point  (2,0,0)  is  on  the  y  =  0,  z  =  0  edge 
of  the  tetrahedron  and  is  the  foot  of  the  perpendicular  on  this  edge  from  (2,3, -l) 
but  the  gradient  Vf  at  (2,0,0)  is  unfavorably  directed  due  to  q  =  -3. 


TCiis  diagram  depicts  the  full  equivalence-class  of  15(*=  2^-1 )  schemata  [there 
is  no  wxyz-schena,  for  otherwise  w=x=y*z=0  would  be  part  of  a  solution). 
Each  node  represents  a  schema  and  each  arc  a  principal  pivot  step  (in  either  direc¬ 
tion).  Arrows  mark  the  four  pivot  steps  used  above  to  get  from  the  xyz-schema  to 
the  yz-,  z-,  null-,  and  vz-schemata.  At  the  same  time  the  diagram  depicts  the 
combinatorial  (incidence)  structure  of  the  constraint  tetrahedron.' 
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DUAL  QUADRATIC  PROGRAMS 

We  seek  to  minimize  f  for  nonnegative  x  «  (x,,...,x  )^,  the  "Latin 

problem,"  and  to  maximize  4  for  nonnegative  5  =  the  "Greek 

problem, "  subject  to  the  quadratic  equation 

f  -  4  ■  fr 

and  to  any  one  of  a  finite  class  of  equivalent  systems  of  linear  equations,  each 
given  by  a  schema 


*1 
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where  l,...,m+n  denotes  a  permutation  of  the  indices  l,...,m+n.  P  and  Q  are 
positive  semidefinite  symmetric  square  submatrices.  Note  the  blsymmetry:  the 
upper  left  and  lower  right  "quarters"  are  symmetric,  while  the  lower  left  is  the 
negative  transpose  of  the  upper  right. 

With  each  such  schema  there  goes  a  pair  of  quadratic  programs  dual  in  the 
sense  of  Cottle: 

(G)  maximize  ?  =  d  +  gb  -  ^  gPg  -  |  x  Qx  for  g  >  0,  g«gA  +  xQ  +  c>0; 

(L)  minimize  f  =  d  +  ex  +  |  x^Qx  +  |  £p£T  for  x  >  0,  x  •  PgT  -  Ax  -  b  >  0  . 

A  value  of  4  (or  f)  is  termed  feasible  if  it  arises  from  a  solution  g,  x  of 
the  schema  having  g  >  0  (or  x  >  0),  For  any  two  solutions  g,  x  and  g',  x' 

*'  -  4  >  l(?i '  ^-Ax'-b)  +  ( gA+x^Q+c )x '  «  gx' 

since  (g'-g)P(g'-g)T  >  0  and  (x'-x)TQ(x'-x)  >  0.  If  g>0  and  x'  >  0, 
then  f '  >  4'  That  is,  each  feasible  f  is  an  upper  bound  for  all  feasible  4 
and  each  feasible  ^  is  a  lower  bound  for  all  feasible  f.  Hence  a  solution  of 
the  above  schema  such  that  g  >  0,  x  >  0  and  gx  ■  C  is  optimal  for  both 
programs.  In  this  case  f  =  ^  *  \{i+<4). 


4, 


•  / 

* 

The  class  of  schemata  equivalent  to  the  above  is  generated  by  three  types 
of  pivotal  exchange:  (l)  pivoting  on  a  diagonal  entry  p  j-  0,  which  decreases 
m  by  one  and  increases  n  by  one;  (2)  pivoting  on  a  diagonal  entry  q^  ^  0, 
which  decreases  n  by  *ne  and  increases  m  by  one;  (3)  pivoting  on  a  nonzero 
skew  pair  a,  ^  and  -a^,  which  does  not  change  m  and  n.  Under  such  pivot¬ 
ing  the  schematic  bisymmetry  and  the  positive  semidefiniteness  of  the  symmetric 
square  submatrices  P  and  Q  are  preserved.  Moreover,  the  submatrices  P  and 

Q  have  constant  nullities  m  =  m-rank  P  and  n  =  n-rank  Q.  Within  this 
-  o  o 

(finite)  class  of  equivalent  schemata  there  must  exist  at  least  one  schema  with 
m  =  mQ,  i.e„,  with  P  =  0  or  with  P,  A,  b  vacuous  if  m  =0,  yielding  a 
"pure"  Latin  program: 

(L  )  minimize  f  =  d  +  cx  +  —  x^Qx  for  x  >  0,  -x  =  Ax  +  b  <  0  . 

O  t  =  = 

This  is  the  classic  type  of  convex  quadratic  program  for  which  Dorn  (and  also 
Dennis)  introduced  duality.  Also,  of  course,  there  is  at  least  one  schema  with 
n  =  nQ,  yielding  a  dual  "pure"  Greek  program: 

(Gq)  maximize  <j>  =  d  +  £b  -  £P|T  for  £  >  0,  V  =  |A  +  c  >  0  . 

The  tw*  programs  become  ordinary  dual  linear  programs  (in  Dantzig's  canonical 

form )  if  m  =  m  and  n  =  n,  so  that  P  =  0  and  Q  —  0. 
o  o 

Recent  work  of  Dantzig  and  Cottle,  of  Lemke,  and  of  Parsons,  shows  that 
within  the  (finite)  class  «f  equivalent  schemata  there  must  exist  a  schema  with 
an  obvious  solution 

i  =  0,  x  =  0,  x  =  -b  >  0,  i  =  c  >  0,  f  =  4  =  d 
•r  a  schema  with  an  obvious  infeasibility  (viz.,  a  nonpositive  column  with  its 
bottom  entry  negative ) . 

NOTE.  The  author  gratefully  acknowledges  the  collaboration  of  Dr.  T.  D. 
Parsons,  Princeton  Uhiversity,  and  of  Dr.  Philip  Wolfe,  IBM  Research  Center. 
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1_. _ Formulation.  Linear  programming,  quadratic  programming,  and 


bimatrix  (two-person,  non  zero-sum)  games  lead  to  the  consideration 


of  the  following  Fundamental  Problem:  Given  a  real  p-vector  q 


and  a  real  p  x  p  matrix  M,  find  vectors  w  and  z  which 

* 

satisfy  the  conditions 


(1) 

w  ■  q  +  Mz , 

tf  >  0,  2  _>  0 

(2) 

o 

1 

> 

N 

The  remainder  of  this  section  ia  devoted  to  an  explanation  of  why 
this  is  so.  (There  are  other  fields  in  which  this  fundamental 
problem  arises  —  see  for  example  [6]  and  [13]  —  but  we  do  not  treat 
them  here.)  Sections  2  and  3  are  concerned  with  constructive 
procedures  for  solving  the  fundamental  problem  under  various  assumptions 
on  the  data  q  and  M. 


* 

In  general,  capital  roman  letters  denote  matrices  while  vectors  are 
denoted  by  lower  case  roman  letters.  Whether  a  vector  is  a  row  or 
a  column  will  always  be  clear  from  the  context,  and  consequently  we 
dispense  with  transpose  signs  on  vectors.  In  (2),  for  example, 
zw  represents  the  scalar  product  of  z(row)  and  w(column).  The 
superscript  indicates  the  transpose  of  the  matrix  to  which  it  is 
affixed. 
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I 

i 


Consider  first  linear  programs  in  the  symmetric  primal-dual  form 
due  to  J.  von  Neumann  (20}. 

Pr<«*l  linear  program:  Find  a  vector  x  and  minimum  2  such  that 

(3)  Ax  £  b,  x  £  0,  z  -  cx 

Dual  linear  program:  Find  a  vector  y  and  maximum  £  such  that 

(4)  yA  <  c,  y  1  0,  2.  -  yb 

The  duality  theorem  of  linear  programming  [3)  states  that 
min  2  «  max  £  when  the  primal  and  dual  systems  (3)  and  (4), 
respectively,  are  consistent  or  —  in  mathematical  programming 
parlance  —  "feasible."  Since 

£  “  yb  £  yAx  £  cx  *  z 

for  all  primal-feasible  x  and  dual-feasible  y,  one  seeks  such 
solutions  for  which 

(5)  yb  -  cx 

The  inequality  constraints  of  the  primal  and  dual  problems 
can  be  converted  to  equivalent  systems  of  equctions  in  non-negative 
variables  through  the  introduction  of  non-negative  "slack"  variables. 
Jointly,  the  systems  (3)  and  (4)  are  equivalent  to 

(6)  Ax  -  v  •  b,  v  0,  x  £  0 

T 

Ay+u-c,  u  £  0 ,  y  £  0 


md  the  linear  programming  problem  becomes  one  of  finding  vector a 
u,  v,  *,  y  auch  that 


n 

f  C1 

>  0 
/ 

-*  \ 

1  * 

i 

M 

f 

♦ 

vJ 

i-b  ) 

\  A 

0  / 

'  y  1 

and  (  by  (5)  ) 


u  >  0,  v  0 
x  2.  °»  y  2.  0 


(8) 


xu  +  yv  •  0 


The  definitions 


establish  the  correspondence  between  (1),  (2)  and  (3),  (4). 

The  quadratic  prc-armaning  problem  is  typically  stated  in  the 
following  Eacnar:  Find  a  vector  x  and  minimum  t  such  that 


(10)  A*  ^  b  ,  x  _>  0  ,  *  •  cx  ♦  -yuDx 

In  this  formulation,  the  matrix  D  may  be  assumed  to  be  symmetric. 

The  minima nd  t  is  a  globally  coovax  function  of  x  if  and  only  if 
the  quadratic  fora  xDx  (or  matrix  D)  is  positiva  smmi-def inlta, 
and  when  thie  la  the  case,  (10)  la  callad  the  ccr”*x  quadratic 
programing  problem.  It  ia  lamed  late  that  whan  D  is  the  sero  matrix, 
(10)  reduces  to  the  linear  program  (3).  In  this  sanse,  tha  llnsar 
programming  problem  le  e  epeclel  case  of  the  quadratic  programing 
problem. 
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For  any  quadratic  programming  problem  (10),  define  u  and  v 
by 

T 

(11)  u-Dx-Ay+c,  v  *  Ax  -  b 

A  vector  x°  yields  minimum  z  only  if  there  exists  a  vector  y° 
and  vectors  u°,  v°  given  by  (11)  for  x  -  x°  satisfying 

(12)  x°  _>  0,  u°  _>  0,  y°  _>  0,  v°  >_  0 

o  o  .  oo. 

x  u  =«  0  ,  y  v  »  0 

These  necessary  conditions  for  a  minimum  in  (10)  are  a  direct 
consequence  of  a  theorem  of  H.  W.  Kuhn  and  A.  W.  Tucker  [14].  It  is 
well  known  —  and  not  difficult  to  prove  from  first  principles  ~ 
that  (12),  known  as  :he  Kuhn-Tucker  conditions,  are  also  sufficient 
In  the  case  of  convex  quadratic  programming.  By  direct  substitution, 
we  have  for  any  feasible  vector  x, 

-o  ,  o.  1  1  o_  o 

z-z  ■  c(x  -  x  )  +■  -j-xDx  -  — 2~ x  Dx 

“  u°(x  -  x°)  +  y°(v  -  v°)  +  -|-(x  ■  x°)D(x  -  x°) 

-  u°x  +  y°v  +  -|-(x  -  x°)D(x  -  x°)  ^  0 

which  proves  the  sufficiency  of  conditions  (12)  for  a  minimum  in  the 
convex  case. 

Thus,  the  problem  of  solving  a  quadratic  program  leads  to  a 
search  for  solution  of  the  system 
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(13) 


T 

u»Dx-Ay+c  x  >  0,  y  >  0 

v  -  Ax  -b  u  0,  v  _>  0 


(14)  xu  +  yv  ■  0 

The  definitions 


establish  (13),  (14)  as  a  problem  of  the  form  (1),  (2). 

Dual  of  a  convex  quadratic  program.  From  (15)  one  is  led  naturally 
to  the  consideration  of  a  matrix  M  ■  ^  j  wherein  E  ,  like  D  , 

is  positive  semi~definite.  It  is  shown  in  [1]  that  the 
Primal  quadratic  program:  Find  x  and  minimum  z  such  that 

(16)  Ax  +  Ey  j>  b,  x  0,  z  •  cx  +  -|-(xDx  +  yEy) 
has  the  associated 

Dual  quadratic  program:  Find  y  and  maximum  z,  such  that 

T  1 

(17)  -Dx  +  A  y  <_  c  ,  y  >  0  ,  z_  ■  by - —  (xDx  -  yEy) 

All  the  results  of  duality  in  linear  programming  extend  to  these 
problems,  and  indeed  they  are  jointly  solvable  if  either  is  solvable. 
When  E  *  0,  the  primal  problem  is  just  (10)  for  which  W,  S.  Dorn 
[5]  first  established  the  duality  theory  later  extended  in  (1).  When 
both  D  and  E  are  zero  matrices,  this  dual  pair  (16),  (17) 

reduces  to  the  dual  pair  of  linear  programs  (3),  (4). 
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REMARKS,  (a)  The  minlmand  In  (10)  Is  strictly  convex  if  and  only 

r 

if  the  quadratic  form  xDx  is  positive  definite.  Any  feasible  strictly 
convex  quadratic  program  has  a  unique  minimizing  solution  x°. 

(b)  When  D  and  E  are  positive  semi-definite  (the  case  of 
convex  quadratic  programming),  so  is 


A  bimatrix  (or  two-person  nonzero-sum)  game.  P(A,B),  is  given  by 
a  pair  of  mxn  matrices  A  and  B  .  One  party,  called  the  row 
player .  has  m  pure  strategies  which  are  identified  with  the  rows 
of  A  .  The  other  party,  called  the  column  player,  has  n  pure 
strategies  which  correspond  to  the  columns  of  B  .  If  the  row  player 
uses  his  1th  pure  strategy  and  the  column  player  uses  his  1th 
pure  strategy,  then  their  respective  losses  are  defined  as  a^  and 
b^  ,  respectively.  Using  mixed  strategies 

m 

x  -  (x. ,..  .x  )  2.  0,  l  x  -  1 
1  °  i-1  1 

n 

y  -  <Vi . yJ  i  I  y^  -  1 

j.i  j 

their  expected  losses  are  xAy  and  xBy,  respectively.  (A  component 
in  a  mixed  strategy  is  interpreted  as  the  probability  with  which  the 
player  uses  the  corresponding  pure  strategy.) 

A  pair  (x°,y°)  of  mixed  strategies  is  a  Nash  [19]  equilibrium 
point  of  T(A,B)  if 
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x°Ay°  <,  xAy°  all  mixed  strategies  x 

x°By°  <_  x°By  all  mixed  strategies  y 

It  is  evident  (see  for  example  [15])  that  if  (x°,y°)  is  an 
equilibrium  point  of  r(A,B),  then  it  is  also  an  equilibrium  point 
for  the  game  T(A'  B')  in  which 

A'  -  [a  +  K]  ,  B'  -  [b±i  +  L] 

where  K  And  L  are  arbitrary  scalars.  Hence  there  is  no  loss  of 
generality  in  assuming  that  A  >  0  and  B  >  0,  and  we  shall  make  this 
assumption  hereafter. 

Next,  by  letting  efc  denote  the  k-vector  all  of  whose  components 
are  unity,  it  is  easily  shown  that  (x°,y°)  is  an  equilibrium  point 
of  r(A,B)  if  and  only  if 

(18)  (x°Ay°)e  ,<  Ay°  (A  >  0) 

m 

(19)  (x°By°)en  <  BTx°  (B  >  0) 

This  characterization  of  an  equilibrium  point  leads  to  a  theorem 

which  relates  the  equilibrium-point  problem  to  a  system  of  the  form 

....  *  *  *  * 

(1),  (2).  For  A  >  0  and  B  >  0,  if  u  ,v  ,x  ,y  is  a  solution 

of  the  system 

(20)  u  •  Ay  -  e  u  >  0,  y  >  0 

III 

T 

v  -  B  x  -  #  v  i  0,  X  >,  o 
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Notice  that  the  assumption  A>0,  B  >  0  precludes  the  possibility 
of  the  matrix  M  above  belonging  to  the  positive  semi-definite  class. 

The  existence  of  an  equilibrium  point  for  r(A,B)  was 
established  by  J.  Nash  [19]  whose  proof  employs  the  Brouwer  Fixed- 
Point  Theorem.  Recently,  an  elementary  constructive  proof  was 
discovered  by  C.  E.  Lemke  and  J.  T.  Howson,  Jr.  [15]. 

2.  Lemke' s  iterative  solution  of  the  fundamental  problem.  This 
section  is  concerned  with  the  iterative  technique  of  Lemke  and  Howson 
for  finding  equilibrium  points  of  bimatrix  games  which  was  later 
extended  by  Lemke  to  the  fundamental  problem  (1),  (2).  We  introduce 
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first  some  terminology  common  to  the  subject  of  this  section  s nd  the 
next.  Consider  the  system  of  linear  equations 

(22)  w  ■  q  +  Mz 

where,  for  the  moment,  the  p-vector  q  end  the  p  x  p  matrix  M 
are  arbitrary.  Both  w  and  z  are  p-vectors. 

For  i  «  l,...,p  the  corresponding  variables  z^  and  v^  are 
called  complementary  and  each  is  the  complement  of  the  other.  A 
complementary  solution  of  (22)  is  a  pair  of  vectors  satisfying 

(22)  and 

(23)  z^  -  0  ,  i  -  1 . . 

Notice  that  a  solution  (w;z)  of  (1),  (2)  is  a  nonnegative 

complementary  solution  of  (22).  Finally,  a  solution  of  (22)  will 
be  called  almoat-complamantary  if  it  satisfies  (23)  except  for  one 
value  of  1,  say  i  ■  S.  That  is,  *g  i  0,  +  0. 

In  general,  the  procedure  assumes  as  aivan  an  extrema  point  of 
the  convex  aat 

2-  |s  j  v  •  q  +  Ms  ^  0,  s  >.  0 

which  also  happens  to  be  the  endpoint  of  an  almost-complmentary 
ray  (unbounded  edge)  of  Z.  Bach  point  of  this  ray  satisfies  (23) 
but  for  ona  value  of  1,  say  6.  It  is  not  always  easy  to  find  such 
a  starting  point  for  an  arbitrary  N.  Tat  there  are  two  Important 
realisations  of  the  fundamental  problem  which  can  be  so  initiated. 


The  first  is  the  bimatrix,  game  case  to  be  discussed  soon;  the  second  is 
the  case  where  an  entire  column  of  M  is  positive.  The  latter  property 
can  always  be  artif ically  induced  by  augmenting  M  with  an  additional 
positive  column;  as  we  shall  see,  this  turns  out  to  be  a  useful  device 
for  initiating  the  procedure  with  a  general  M. 

Each  Iteration  corresponds  to  motion  from  an  extreme  point 
along  an  edge  of  1  all  points  of  which  are  almost-complementary 
solutions  of  (22).  If  this  edge  is  bounded,  an  adjacent  extreme 
point  P^+^  is  reached  which  is  either  complementary  or  almost  - 
complementary.  The  process  terminates  if  (i)  the  edge  is  unbounded 
(a  ray),  (ii)  P^+^  is  a  previously  generated  extreme  point,  or 
(ill)  F.^„  is  a  complementary  extreme  point. 

Under  the  assumption  of  nondegeneracy,  the  extreme  points  of  Z 
are  in  one-to-one  correspondence  with  the  basic  feasible  solutions  of 

(22)  (See  [3]  ).  Still  under  this  assumption,  a  complementary  basic 
feasible  solution  is  one  in  which  the  complement  of  each  basic 
variable  is  nonbaslc.  The  goal  is  to  obtain  a  basic  feasible  solution 
with  such  a  property.  In  an  almost-complementary  basic  feasible  of 

(23) ,  there  will  be  exactly  one  index,  say  6  ,  such  •'hat  both 

v0  and  zfl  are  basic  variables.  Likewise,  there  will  be  exactly  one 


index,  eay  v  ,  such  that  both  w  and  t  are  nonbaslc  variables  . 

J  v  v 


C.  van  de  Panne  and  A.  Whinston  (21]  have  used  the  appropriate 
terms  basic  and  nonbaslc  pair  for  /wfl  ,  *Rl  and  /w  ,  z  \ 

respectively.  *  J  \  v  VJ 
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An  almost-complementary  edge  is  generated  by  holding  all  nonbaslc 


variables  at  value  zero  and  increasing  either  zv  or  w^  of  the 
nonbasic  pair  z^  ,  w^  .  There  are  consequently  exactly  two  almost- 
complementary  edges  associated  with  an  almost-complementary  ext  rente 
point  (corresponding  to  an  almost-complementary  basic  feasible 
solution) . 

Suppose  that  is  the  nonbasic  variable  to  be  increased.  The 

values  of  the  basic  variables  will  change  linearly  with  the  changes 
in  z^  .  For  sufficiently  snail  positive  values  of  z^,  the 
almost-complementary  solution  remains  feasible.  This  is  a  consequence 
of  the  nondegeneracy  assumption.  But  in  order  to  retain  feasibility, 
the  values  of  the  basic  variables  must  be  prevented  from  becoming 
negative. 

If  the  value  of  zy  can  be  made  arbitrarily  large  without 
forcing  any  basic  variable  to  become  negative,  then  a  rev  la  generated. 
In  this  event,  the  process  terminates.  However,  If  some  basic 
variable  blocks  the  increase  of  z^  (l.e.  vanishes  for  a  positive 
value  of  *y) ,  then  a  new  basic  solution  is  obtained  which  Is 
either  complementary  or  almost-complementary.  A  complementary 
solution  occurs  only  If  a  member  of  the  basic  pair  blocks  . 

A  new  almost-complementary  extrema  point  solution  is  obtained  If  the 
blocking  occurs  otherwise.  In  the  complementary  case,  i«e  have  the 
desired  result:  a  complementary  basic  feasible  solution.  In  the 
almost-complementary  case,  the  nondegeneracy  assumption  guarantees 
the  uniqueness  of  the  blocking  variable.  It  will  become  nonbasic  in 
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place  of  zy  and  Its  index  becomes  the  new  value  of  v  . 

The  complementary  rule.  The  complement  of  the  (now  nonbaslc)  blocking 
variable  —  or  equivalently  put,  the  ether  member  of  the  "new" 
nonbasic  pair  —  is  the  next  nonbasic  variable  to  be  Increased.  The 
procedure  consists  of  the  iteration  of  these  steps.  The  generated 
sequence  of  almost-complementary  extreme  points  and  edges  is  called 
an  alaost-complementarv  path. 

THEOREM  1.  Along  an  alisost-complementary  path,  the  only  almost- 
complementary  basic  feasible  solution  which  can  re-occur  is  the 
initial  one. 

PROOF:  We  assume  that  all  basic  feasible  solutions  of  (22)  are 
nondegenerate.  (This  can  be  assured  by  any  of  the  standard 
lexicographic  techniques  (3]  for  resolving  the  ambiguities  of 
degeneracy.)  Suppose,  contrary  to  the  assertion  of  the  theorem, 
that  the  procedure  generates  a  sequence  of  almost-cooplementary 
basic  feasible  solutions  in  which  a  term  other  than  the  first  one 
(Pq  in  the  figure  below)  is  repeated  (say  P^).  By  the  nondegeneracy 
assumption,  the  extreme  points  of  Z  are  in  one-one  correspondence 
with  basic  feasible  solutions  cf  (22).  Let  P^  denote  the  successor 
of  P^  end  let  denote  the  second  predecessor  to  P^  namely 

the  one  along  the  path  just  before  the  return  to  P^. 
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The  extreme  points  pc*P2,Pk  are  di8tinct  and  each  is  adjacent  to 
along  an  almost-complementary  edge.  But  there  are  only  two  such 
edges  at  P^.  This  contradiction  completes  the  proof. 

We  can  immediately  state  the 

COROLLARY.  If  the  almost-complementary  path  is  initiated  at  the 
endpoint  of  an  almost-complementary  ray,  the  procedure  must  terminate 
either  in  a  different  ray  or  a  complementary  basic  feasible  solution. 

It  is  easy  to  show  by  examples  thst  starting  from  an  almost- 
complementary  basic,  feasible  solution  which  is  not  the  endpoint  of 
an  almost-complementary  ray,  the  procedure  can  return  to  the  initial 
point  regardless  of  the  existence  or  non-existence  of  a  solution  to 
(1),  (2). 

EXAMPLE  1.  The  set  Z  associated  with 


1 1 

f  0 

0 

°\ 

-1 

M  - 

1 

0 

0 

3  J 

I'1 

-1 

-w 

is  nonempty  and  bounded.  It  is  clear  that  no  solution  of  (1)  can 
also  satisfy  (2)  sines  *^>0.  Let  the  extreme  point 
corresponding  to  the  solution  v  -  (1,0,0),  s  •  (1,0,2)  bs  the 
initial  point  of  a  path  which  begins  by  increasing  s^-  This  will 
return  to  the  Initial  cxtrssie  point  after  4  Iterations. 

EXAMPLE  2.  The  set  l  associated  with 
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Is  likewise  nonempty  and  bounded.  The  corresponding  fundamental 
problem  (1),  (2)  has  a  complementary  solution  w  *  (1,0, 1,0), 

z  ■  (0,1, 0,1).  Yet  by  starting  at  w  °  (1,2, 0,1),  z  ■  (3, 0,0,0)  and 
increasing  z^,  the  method  generates  a  path  which  returns  to  its 
starting  point  after  4  iterations. 

Furthermore,  even  if  the  procedure  is  initiated  from  an  extreme 
point  at  the  end  of  an  almost-complementary  ray,  termination  in  a  ray 
is  possible  whether  or  not  the  fundamental  problem  has  a  solution. 
EXAMPLE  3.  Giver,  the  data 


the  point  of  Z  corresponding  to  w  ■  (1,0, 4,1),  z  «  (1,0, 0,0)  is 
at  the  end  of  an  almost-complementary  ray,  w  *  (l.w^^  +  Wj,!), 
z  *  (1  +  W2»0,0,0).  Moving  along  the  edge  generated  by  increasing 
z 2  leads  to  a  new  almost-complementary  extreme  point  at  which  the 
required  increase  of  z^  is  unblocked,  so  that  the  process  terminates 
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in  a  ray,  and  yet  the  fundamental  probelm  is  solved  by 

w-  (2, 0,1,0),  2  -  (0,1, 0,1). 

EXAMPLE  4.  In  the  problem  with 


the  inequalities  (1)  have  solutions,  but  none  of  them  satisfy  (2). 

The  point  corresponding  to  (v;z)  ^  (1,0; 1,0)  is  at  the  end  of  an 
almost-compleraentary  ray  w  *  (l.w^),  z  ■  («£,()) .  When  z^  is 
Increased,  it  is  not  blocked,  and  the  process  terminates  in  a  ray. 
Consequences  of  termination  ...  ray.  In  this  geometrical  approach 
to  the  fundamental  problem,  it  ic.  useful  to  interpret  algebraically 
the  meaning  of  termination  in  an  almost-complementary  ray.  This  can 
be  achieved  by  use  of  a  standard  result  in  linear  inequality  theory 
[U],  (3]. 

*  * 

LEMMA,  If  (w  ;z  )  is  an  almost-complementary  basic  feasible 

*  * 

solution  of  (22),  and  (w  ;z  )  is  incident  to  an  almost-complementary 
ray,  there  exist  p~vectors  w^.z*1  such  that 

(24)  w*1  ■  Mz*1,  w*1  0,  zh  >,  0,  z*1  j*  0 

and  points  along  the  almost-complementary  ray  are  of  ihe  form 

(25)  (w  +  Aw*1  ,  z*  +  Az*1)  l  >,  0 
and  satisfy 
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(26) 


(w^  +  Aw^) (z  +  Az^')  ■  0  for  all  A  _>.  0,  and  all  i  f*  8 

THEOREM  2.  If  M  >  0,  (22)  has  a  complementary  basic  feasible 

solution  for  any  vector  q. 

PROOF.  Select  w^,...,  as  the  basic  variables  in  (22).  He  may 
assume  that  q  £  0  for  otherwise  (w;z)  ■  (q ; 0)  immediately  solves 
the  problem.  A  starting  ray  of  feasible  almost-complementary 
solutions  is  generated  by  taking  a  sufficiently  large  value  of 
any  nonbasic  variable,  say  z^.  Reduce  z^  toward  zero  until  it 
reaches  a  value  z°  >_  0  at  which  a  unique  basic  variable  (assuming 
non-degeneracy)  becomes  zero.  An  extreme  point  has  then  been 

V  * 

reached . 

The  procedure  has  been  initiated  in  tb“  manner  described  by  the 

corollary  above,  and  consequently  the  procedure  must  terminate  either 

in  a  complementary  basic  feasible  solution  or  in  an  aJmost- 

complementary  ray  after  some  basic  feasible  solution  (w;z*)  is 

reached.  We  now  show  that  the  latter  cannot  happen.  For  if  it  does, 

conditions  (24)  -  (26)  of  th'>  lemma  obtain  wi^h  6  ■  1.  Since 

M  >  0  and  z^  ^  0,  this  implies  w*'  >  0.  Hence  by  (26), 

*  h 

z^  »  z^  ■  0  for  all  i  p  1.  Hence  the  only  variables  which  change 
with  A  are  z^  and  the  components  of  w.  Therefore  the  final 
generated  ray  is  the  same  as  the  initiating  ray,  which  contradicts  the 
corollary. 

THEOREM  3.  A  bimatrix  game  T(A,B)  has  an  extreme  equilibrium  point. 


/ 
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PROOF.  Initiate  the  algorithm  by  choosing  the  smallest  positive 
value  of  x^,  say  x°,  such  that 


(27) 


-  e  +  B?x°  >  0 
n  11  — 


T  T 

where  B^  is  the  first  column  of  B  .  With 


-  e  +  B?x° 
n  11 


it  follows  (assuming  nondegeneracy)  that  v°  has  „xactly  one  zero 
component,  say  the  r-th.  The  ray  is  generated  by  choosing  as  basic 
variables  x^  and  all  the  slack  variables  u,v  except,  for  vr< 

The  complement  of  vf,  namely  yf,  is  chosen  as  the  nonbaslc 
variable  to  increase  indefinitely.  For  sufficiently  large  values  of 
yr,  the  basic  variables  are  all  nonnegative  and  the  ray  so  generated 
is  complementary  except  possibly  x^u^  might  not  equal  0.  Letting 
yr  decrease  toward  zero,  the  initial  extreme  point  is  obtained  for 
some  positive  value  of  y^. 

If  the  procedure  does  not  terminate  in  an  equilibrium  point,  then 

by  the  corollary,  it  terminates  in  an  almost-complementary  ray.  The 

latter  implies  the  existence  of  a  class  of  almost-complementary 

* 

solutions  of  the  form 


(28) 


*  ,  hi 

u  +  Au 

/  -e 

'  0 

A1 

/x  +  Ax^\ 

,  v*  +  *vh , 

\  n 

+ 

°l 

\y*  +  *yhJ 

The  notatlonal  analogy  with  the  previously  studied  case  M  >  0  is 
obvious . 
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(29) 


all  i  +  1 N 


(30) 


(u*  +  Xu^)(xA  +  Xx^)  -  0 

*  h  *  h 
(Vj  +  XvJ)  (yj  +  Xyp  -  0 


all  j 


all  X  >  0 


J 

Assume  first  that  x*1  +  0.  Then  v*1  ■  B^x*1  >  0.  By  (30), 

*  h  *h 

y,  +  Xy,  -  0  for  all  j  and  all  X  >  0.  But  then  u  +  Xu  ■  -e  <  0, 
7j  J  -  m 

h  h 

a  contradiction.  Assume  next  that  y  i  0  and  x  >0.  Then 

u*1  •  Ay*1  >  0.  By  (29),  x^  »  0  for  all  i  i  1;  and  x^  »  0  for 
h  T  h  * 

all  i.  Hence  v  *  B  x  *  0  and  v  is  the  same  as  v  defined  by 

(27)  since  x^  must  be  at  the  smallest  value  in  order  that 
*  *  *  * 

(u  ,v  ,x  ,y  )  be  an  extreme-point  solution.  By  the  nondegeneracy 

A 

assumption,  only  -  0,  and  >  0  for  all  j  i4  r.  Hence  (30) 

*  h 

implies  y^  +  Xy^  •  0  for  all  j  f  r.  It  is  now  clear  that  the 
postulated  terminating  ray  is  the  original  ray.  This  furnishes  the 
desired  contradiction.  The  algorithm  must  terminate  in  an  equilibrium 
point  of  the  bimatrix  game  r(A,B). 

A  modification  of  almost-complementarv  basic  sets.  Consider  the 
system  of  equations 


(31)  w  ■  q  +  a  z  +  Mz 

P  0 

where  z^  represents  an  "artifical  variable"  and  e^  is  a  p-vector 
(1,...,1).  It  is  clear  that  (31)  always  has  nonnegative  solutions. 
A  solution  of  (31)  is  called  almost-complementarv  if 
*iWi  "  ®  ^or  *  “  is  complementary  if,  in  addition, 

iq  •  0.  (See  [16,  p.  685]  where  a  different  but  equivalent 
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definition  is  given.)  In  this  case,  let 

Zo  ”  {(Vz)  i  w  "  «  +  Vo  +  Mz  i  °‘  Zo  -  z  -  0  ) 

We  consider  the  almost-complementary  ray  generated  by  sufficiently 
large  zq.  The  variables  are  initially  basic  while 

zq,z^ . zp  are  nonbasic  variables.  For  a  sufficiently  large  value 

,  + 
of  zq,  say  zq, 

w+  ■  q  +  e  zr  >  0 
p  o 

As  zq  decreases  toward  zero,  the  basic  variables  w^  decrease.  An 

initial  extreme  point  is  reached  when  zq  attains  the  minimum  value 

z°  for  which  w  ■  q  +  e  z  >0.  If  z°  ■  0,  then  q  >  0;  this 
o  p  o  ~  o  — 

is  the  trivial  case  for  which  no  algorithm  is  required.  If 

z°  >  0,  some  unique  basic  variable,  say  w r  has  reached  its  lover 

bound  0.  Then  z  becomes  a  basic  variable  in  place  of  w  and 
o  r 

we  have  v  -  r.  Next,  zr,  the  complement  of  wf,  is  to  be 
increased. 

The  remaining  a taps  of  the  procedure  are  now  identical  to  thosa 

in  tha  preceding  algorithm.  After  a  blocking  variable  becomes  basic, 

its  complement  is  increased  until  either  a  basic  variable  blocks 

the  increase  (by  attaining  its  lower  bound  0)  or  else  an  alaoet- 

complamentary  ray  is  generated.  There  are  precisely  two  forms  of 

termination.  One  is  in  a  ray  as  just  described;  the  other  la  in  the 

reduction  of  zA  to  the  value  0  and  hence  tha  attainment  of  a 
o 


complementary  basic  feasible  eolution  of  (31),  i.e.  a  solution  of 


(1),  (2). 


Interest  now  centers  on  the  meaning  of  termination  in  an  almost- 

compleraentary  ray  r >lution  of  (31) .  For  certain  classes  of  matrices. 

the  proceba  described  above  terminates  in  an  almost-complementary 

ray  if  and  only  if  the  original  system  (1)  has  no  solution.  In  the 

remainder  of  this  section,  we  shall  amplify  the  preceding  statement. 

If  termination  in  an  almost-complementary  ray  occurs  after  the 

*  *  * 

process  reaches  a  basic  feasible  solution  (w  ;zq,z  )  corresponding 

to  an  extreme  point  of  Zq,  then  there  exists  a  nonzero  vector 

(w^jz^.z*1)  such  that 
o 

/.MV  h  h  .  „  h  ,  h  h  h.  « 

(32)  w  *ez  +  M z  ,  (w  ;z  , z  )  >  0 

p  o  o  — 

Moreover  for  every  i  >  0  , 

(33) 

*  h  *  h  *  h 

(w  +  Aw  )  -  q  +  Cp(*0  +  Xz^)  +  M(z  +  Xz  ) 

and 

(34)  (w*  +  Xz^)(z*  +  Xz^)  “0  i  -  1 . .  . 

The  case  z*1  »  0  is  ruled  out,  for  otherwise  z*1  >  0  and  then 

o 

wh  >  0  because  (wh;z^,zh)  i  0.  Now  if  wh  >  0,  (34)  implies 

*  h  * 

z  +  Xz  ■  z  ■  0.  This,  in  turn,  implies  that  the  ray  is  the 
original  one  which  is  not  possible. 

Furthermore,  it  follows  from  the  almost-complementarity  of 
solutions  along  the  i,ty  that 
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(35) 


i  *  1#  •  •  •  pp  • 


*  *  *  h  h  *  hh  n 

ziw  i  "  ziwi  “  ziwi  “  ziwi  “  0 


The  individual  equations  of  the  system  (32)  are  of  the  form 


(36) 


h  h  .  .  h » 

ui  ■  2o  +  (Mz  >1 


i  ■  P 


Multiplication  of  (36)  by  z^  leads,  via  (35),  to 


(37) 


0  -  zjz*1  +  z^(Mzh) . 
1  o  i  i 


i  -  l,...,p 


from  which  we  conclude 

THEOREM  4:  Termination  in  a  ray  implies  there  exists  a  nonzero 
nonnegative  vector  z1  such  that 


(38) 


z^(Mzh)t  <.  0 


i  ■  1,  •  •  •  »P 


At  this  juncture,  two  large  classes  of  matrices  M  will  be 
considered.  For  the  first  class,  we  show  that  termination  in  a  ray 
implies  the  inconsistency  of  the  system  (1).  For  the  second  i lass, 
we  will  show  that  termination  in  a  ray  cannot  occur,  so  that  for  this 
class  of  matrices,  (1),  (2)  always  has  a  solution  regardless  of 

what  q  is . 

The  first  class  mentioned  above  was  Introduced  by  LesUte  [16]. 
These  matrices,  which  we  shall  refer  to  as  copositive  plus,  are 
required  to  satisfy  the  two  conditions. 


(39) 


uMu  >  0  for  all  u  >  0 
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i,  ► 


(40)  (M  +  M^)u  *0  if  uMu  ■  0  and  u  _>  0 

Matrices  satisfying  conditions  (39)  alone  art.  known  in  the 
literature  as  copositive  (see  [18],  [12].)  To  our  knowledge,  there 

is  no  reference  other  than  l16l  on  copositive  matrices 
satisfying  the  condition  (40).  However,  the  class  of  such 
matrices  is  large  and  includes 

(i)  all  strictly  copositive  matrices,  i.e.  those 
for  which  uMu  0  when  0  +  u  >_  0 
(li)  all  positive  semi-definite  matrices,  i.e.  those 
for  which  uMu  _>  0  for  all  u. 

Positive  matrices  are  obviously  strictly  copositive  while  positive 
definite  matrices  are  both  positive  semi-definite  and  strictly 
copositive.  Furthermore,  it  is  possible  to  "build"  matrices 
satisfying  (39)  and  (40)  out  of  smaller  ones.  For  example,  if 
and  M^  are  matrices  satisfying  (39)  and  (40)  then  so  is  the 
block-diagonal  matrix 


Moreover,  if  M  satisfies  (39)  and  (40)  and  S  is  any  skew- 
symmetric  matrix  (of  its  order),  then  M  +  S  satisfies  (39)  and 
(40).  Consequently,  block  matrices  such  as 
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\  A  M2/ 


satisfy  (39)  and  (40)  if  and  only  if  and  do  too.  However, 

as  Lemke  [16],  [17]  has  pointed  out,  the  matrices  encoun. “red  in  the 
bimatrix  game  problem  with  A  >  0  and  B  >  0  need  not  satisfy  (40). 
The  Lemke-Howson  iterative  procedure  for  bimatrix  games  was  given 
earlier  in  this  section.  If  applied  to  bimatrix  games,  the 
modification  Just  given  always  terminates  in  a  ray  after  just  one 
iteration,  as  can  be  verified  by  taking  any  example. 

The  second  class,  consisting  of  matrices  having  positive  principal 
minors,  has  been  studied  by  numerous  investigators;  see  for  example, 
[2],  [4],  [8],  [9],  [10],  [22],  [24].  In  the  case  of 

symmetric  matrices,  those  with  positive  principal  minors  are  positive 
definite.  But  the  equivalence  breaks  down  in  the  non-symmetrlc 
situation.  Nonsymmetric  matrices  with  positive  principal  minors  need 
not  be  positive  definite.  For  example,  the  matrix 

2  -7 

-1  4 

has  positive  principal  minors  but  is  Indefinite  and  not  copositive. 
However,  positive  definite  matrices  are  a  subset  of  those  with 
positive  principal  minora.  (Sec,  e.g.  (2).) 
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We  shall  sake  use  of  the  fact  that  w  ■  q  +  Mz,  (w;z)  _>  0  has 
no  solution  If  there  exists  a  vector  v  such  that 

(41)  vM  <_  0,  vq  <  0,  v  ^  0 

for  otherwise,  0  vw  »*  vq  +  vMz  <  0,  a  contradiction.  Indeed,  it 
is  a  consequence  of  J.  Far  leas'  theorem  [7]  that  (1)  has  no 
solution  if  and  only  if  there  exists  a  solution  of  (41) . 

THEOREM  5.  Let  M  be  copositive  plus.  If  the  iterative  procedure 
terminates  in  a  ray,  then  (1)  has  no  solution. 

PROOF.  Termination  in  a  ray  means  that  a  basic  feasible  solution 

*  *  * 

(v  ;z  ,z  )  will  be  reached  at  which  conditions  (32)  -  (34) 
o 

hold  and  also 

(42)  0  -  zhwh  -  zhepz£  +  zNz*1 

Since  M  is  copositive  and  z*1  _>  0,  both  terms  on  the  right  side  of 
(42)  are  nonnegative,  hence  both  are  zero.  The  scalar  z^  ■  0 
because  z**ep  >  0.  The  vanishing  of  the  quadratic  form  z**Mzh  means 

„  h  ,  UT  h  _ 

Mz  +  M  x  -  0 

But  by  (32),  z^  •  0  implies  that  w*1  »  Mz^  ^  0,  whence  M*z^  0 
or,  what  is  the  sea.e  thing,  z**M  0.  Next,  by  (35), 

n  *  h  •  h  »,  T  h.  (l  * 

0  -  z  w  -  z  Mz  -  z  (-M  z  )  •  -z  MX 

and  we  obtain  again  by  (35) 
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.  h  *  h  .  h  *  ,  K.  *  h  ,  h  * 

0  *  z  w  -  zq  +  zez  +  z  rlz  ■  z  q  +  z  e  z 

p  0  ’  p  o 

h  h  * 

It  follows  that  z  q  <  0  because  z  e  z  >0.  The  conditions  (1) 
are  therefore  inconsistent  because  v  ■  z*1  satisfies  (41). 
COROLLARY.  If  M  is  strictly  copositive,  the  process  terminates 
in  a  complementary  basic  feasible  solution  of  (31). 

PROOF.  If  not,  the  proof  of  theorem  5  would  imply  the  existence  of  a 
vector  z*1  satisfying  z*^Mzh  -  0,  0  i  zh  >.  0  which  contradicts 

the  strict  copositivity  of  M. 

This  corollary  clearly  generalizes  Theorem  1.  We  now  turn  tc 
the  matrices  M  having  positive  principal  minors. 

THEOREM  6.  If  M  has  positive  principal  minors,  the  process 
terminates  in  a  complementary  basic  solution  of  (31)  for  any  q. 
PROOF.  We  have  seen  that  termination  in  a  ray  inplies  the  existence 
of  a  nonzero  vector  zh  satisfying  the  inequalities  (38).  However, 
Gale  and  Nikaldo  (10  ,  Theorem  2]  have  shown  that  matrices  with 
positive  principal  minors  are  characterized  by  the  impossibility  of 
this  event.  Hence  termination  it.  a  ray  is  not  a  possible  outcome 
for  problems  in  which  M  has  positive  principal  minors. 

We  can  even  improve  upon  this. 

THEOREM  7.  If  M  has  the  property  that  for  each  of  its  principal 
submatrices  M,  the  system 

Mr  ^  0,  0  i  i  ^  0 

has  no  solution,  then  the  process  terminates  in  a  complementary 
basic  solution  cf  (31)  for  any  q. 
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PROOF.  Suppose  the  process  terminates  in  a  ray.  From  the  solution 

Jj 

(w;z”,z  )  of  the  homogeneous  system  (32) ,  define  the  vector 

fc*1  of  components  of  w*1  for  which  the  corresponding  component 

*  h  h  h 

of  z  +  z  is  cositive.  Then  by  (34)  *  ■  0.  Let  2  be  the 

vector  of  corresponding  components  in  z*1.  Clearly  0  ^  2^  j>  0, 

since  i  j*  z*1  0  and  any  positive  component  of  z*1  is  a  positive 

component  of  z1  by  Jefi  ition  of  wh.  Let  M  be  the  corresponding 

principal  submatrix  of  M.  Since  M  is  a  matrix  of  order  L  ^  1 

we  may  write 

„  .h  h  .  - ..  h 
0  ■  w  *  e.  z  +  Mz 
k  o 

Hence 

M2h  <_  0,  0  i  0 

which  is  a  contradiction. 

3.  The  principal  pivoting  method.  We  shall  now  describe  an 
algorithm  proposed  by  the  authors  [4]  which  predates  that  of  Lemke. 
It  evolved  from  a  quadratic  programming  algorithm  of  P.  Wolfe  [26] 
who  was  the  first  to  use  a  type  of  complementary  rule  for  pivot 
choice.  Our  method  is  applicable  to  matrices  K  that  have  positive 
principal  minors  (i^  particular  to  positive  definite  matrices) 
and  after  a  minor  modification,  to  positive  semi-definite  matrices. 

In  Lemke's  procedure  for  general  M,  an  artifical  variable  zq 
is  Introduced  in  ofder  to  obtain  feasible  almost-complementary 
solutions  for  the  augmented  problem.  In  our  approach,  only  variables 
of  the  original  problem  are  used,  but  these  can  take  on  initially 
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negative  as  well  as  non-negative  values. 


A  major  cycle  of  the  algorithm  is  initiated  with  the  complementary 
basic  solution  (w;z)  *  (q ; 0) .  If  q  0,  the  procedure  is  Immediately 
terminated.  If  q  0,  we  may  assume  (relabeling  if  necessary) 
that  w^  *  <  0.  An  almost-complemencary  path  is  generated  by 

increasing  z^,  the  complement  of  the  selected  negative  basic 
variable.  For  points  along  the  path,  z^w^  “  ®  for  i  f*  1. 

Step  I.  Increase  z^  until  it  is  blocked  by  a  positive  basic 
variable  decreasing  to  zero  or  by  the  negative  increasing  to 

zero. 

Step  II.  Make  the  blocking  variable  nonbasic  by  pivoting  its 
complement  into  the  basic  j;et.  The  major  cycle  is  terminated  if 
w^  drops  out  of  the  basic  set  of  variables.  Otherwise,  return 
to  Step  I. 

It  will  be  rhown  that  during  a  major  cycle  w^  increases  to 
zero.  At  this  point,  a  new  complementary  basic  solution  is  obtained. 
However,  the  number  of  basic  variables  with  negative  values  is  at 
least  one  less  tii  •.«  at  the  beginning  of  the  major  cycle.  Since  there 
are  at  most  p  negative  basic  variables,  no  more  than  p  major 
cycles  are  required  to  obtain  a  complementary  feasible  solution  of 
(22).  The  proof  depends  on  certain  properties  of  matrices  invariant 
under  principal  pivoting. 

Principal  pivot  transform  of  a  matrix.  Consider  the  homogeneous 
system  v  **  Mu  where  M  is  a  square  matrix.  Here  the  variables 
V...,Vp  are  basic  and  expressed  in  terms  of  the  nonbssic  variables 
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. . u  .  Let  any  subset  of  the  v.  be  made  nonbasic  and  the 

I  p  i 

corresponding  basic.  Relable  the  full  set  of  basic  variables 
v  and  the  corresponding  nonbasic  variables  u.  Let  v  ■  Mu 
express  the  new  basic  variables  v  in  terms  of  the  nonbasic  ones. 

The  matrix  M  is  called  a  principal  pivot  transform  of  M.  Of 
course,  this  transformation  can  be  carried  out  only  if  the  principal 
submatrix  of  M  corresponding  tc  the  set  of  variables  and 

interchanged  is  nonsingular,  and  this  will  be  assumed  whenever 
the  term  is  used. 

THEOREM  8.  (Tucker  [24]).  If  a  square  matrix  M  has  positive 
principal  minors,  so  does  every  principal  pivot  transform  of  M. 

The  proof  of  this  theorem  is  easily  obtained  inductively  by 
exchanging  the  roles  of  one  complementary  pair  and  evaluating  the 
resulting  principal  minors  in  terms  of  those  of  M. 

THEOREM  9.  If  a  matrix  M  is  positive  definite  or  positive  semi- 
definite  so  is  every  principal  pivot  transform  of  M. 

PROOF.  The  original  proof  given  by  the  authors  was  along  the 
lines  of  that  for  the  preceding  theorem.  P.  Wolfe  has  suggested  the 
following  elegant  proof.  Consider  v  -  Mu.  After  the  principal 
pivot  transformation,  let  v  »  Mu,  where  u  is  the  new  set  of 
nonbasic  variables-  We  wish  to  show  that  uMu  -  uv  >  0  if 
uMu  ■  uv  >  0.  If  M  is  positive  definite,  the  latter  is  true  if 
u  i  0,  and  the  former  must  hold  because  every  pair  is 

identical  with  (u^.v^)  except  possibly  in  reverse  order.  Hence 
yG^v^  ■  £ujV^  >  0.  The  proof  in  the  semi-definite  case  replaces  the 
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inequality  >  by  _>  . 

Validity  of  the  algorithm.  The  proof  given  below  for  p  ■  3  goea 
through  for  general  p.  Consider 


,1  +  m11i1  +  mi2z2  +  "13*3 
"2  "  *21zl  +  m22z2  +  ®23z3 


’3  *■  m31*l 


Suppose  that  M  has  positive  principal  minors  so  that  the  diagonal 
coefficients  are  all  positive: 

"U  ’  °*  "22  ‘  °*  "33  '  0 

Suppose  furthermore  that  some  q^  is  negative,  say  q^  <  0.  Then 
the  solution  (w;z)  -  (q^q^q^jOjOjO)  is  complementary,  but  not 
feasible  because  a  particular  variable,  in  this  case  w^,  which 
we  refer  to  as  distinguished  is  negative.  We  now  initiate  an  almost* 
complementary  path  by  Increasing  the  complement  of  the  distinguished 
variable,  in  this  case  z^,  which  we  call  the  driving  variable. 
Adjusting  the  basic  variables,  we  have 

(w;.)1  -  <,x  +  .u.v  ,2  ♦  A,  ♦  *32*i  *0*0*0) 

Note  that  the  distinguished  variable  w^  increases  strictly  with 
the  Increase  of  the  driving  variable  z^  because  a^  >  0. 

Assuming  nondegeneracy,  we  can  increase  z^  by  a  positive  amount 
before  it  is  blocked  either  by  reaching  zero  or  by  s  basic 
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variable  that  waa  positive  and  ia  nov  turning  negative. 

* 

In  the  former  case,  for  some  positive  value  z^  of  the 

* 

driving  variable  i^t  we  have  wi  "  qi  +  mll*i  "  T^ie  •°lution 

2  *  * 

(w;z)  -  (0,q2  +  m^z^  <13  +  ®31z1»  0*°»°) 

ia  complementary  and  has  one  less  negative  component.  Pivoting  on 
m^,  replaces  w^  by  z^  as  a  basic  variable.  By  Theorem  8,  the 
matrix  M  in  the  new  canonical  system  relabeled  w  ■  q  +  Mz 
has  positive  principal  minors,  allowing  the  entire  major  cycle  to  be 
repeated. 

In  the  latter  case,  we  have  some  other  basic  variable,  say 

A 

w2  *  qj  +  n2izi  blocking  when  z^  *  z^  >  0.  Then  clearly 
m21  <  0  and  q^  >0.  In  this  case, 

<w;z)2  -  <mnz*  +  qr0,  n^zj  q3;z*,0,0) 

THEOREM  10.  If  the  driving  variable  is  blocked  by  a  basic  variable 
other  than  its  complement,  a  principal  pivot  exchanging  the 
blocking  variable  with  its  complement  will  permit  the  further  increase 
of  the  driving  variable. 

PROOF:  Pivoting  on  m^  generates  the  c tnonlcal  system 

*1  •  V  *11*1  ’  *12w2  *  *13*3 

*2  ’2  *  *22*1  *  *22,2  *  *23*3 

v3  *  V  *31*1  *  *32*2  +  *33*3 
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2 

The  eolution  (w;z)  must  satisfy  the  above  since  it  is  an 

* 

equivalent  system.  Therefore  setting  z^  ■  z^,w2  *  ^,  *3*0 

yields 

2  _  *  _  *  * 

(w;z)  •  (qx  +  0,  q3  +  m^z^  ^.0*0) 

i.e.,  the  same  almost-complementary  solution.  Increasing  z^ 

* 

beyond  z^  yields 


(qx  +  “nV  °»  53  +  m31z1;z1,0,0) 

which  is  also  almost-complementary.  The  sign  of  is  the 

reverse  of  m21,  since  m21  •  -®2l/,B22  >  °*  Hence  *2 

ft 

with  increasing  z^  >  z^  ;  i.e.,  the  new  basic  variable  replacing 
w2  is  not  blocking.  Since  M  has  positive  principal  minors, 

®n  >  0-  H*nce  w:  cqnUavfT  jo  kMMU  £*&  *x  >  **  • 

THEOREM  11.  The  number  of  iterations  within  a  major  cycle  is  finite. 

PROOF:  There  are  only  finitely  many  possible  bases.  Ho  basis 

can  be  repeated  with  a  larger  value  of  s^.  To  see  this,  suppose  it 
**  * 

did  for  s^  >  fx  *  This  would  imply  that  some  component  of  the 

* 

solution  turns  negative  at  s^  *  s^  and  yet  is  nonnegative  when 
** 

^  ■  *1  .  Since  the  value  of  a  component  is  linear  in  s^  we  have 
a  contradiction. 

EttftBhries  ot  the  principal  pivoting  method.  Along  the  almost- 
complementary  path  there  is  only  one  degree  of  freedom.  In  the  proof 
of  the  validity  of  the  algorithm,  s^  was  increasing  and  s2  was 
shown  to  Increase.  The  same  class  of  solutions  can  be  generated 
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by  regarding  s.  as  the  driving  variable  and  the  other  variables 
as  adjusting.  Hence  within  each  major  cycle,  the  same  almost- 
complementary  path  can  be  generated  as  follows.  The  first  edge 
is  obtained  by  using  the  complement  of  the  distinguished  variable 
as  the  driving  variable.  As  soon  as  the  driving  variable  is 
blocked,  the  following  steps  are  iterated: 

a)  replace  the  blocking  variable  by  the  driving  variable  and 
terminate  the  major  cycle  if  the  blocking  variable  is 
distinguished;  if  the  blocking  variable  is  not 
distinguished. 

b)  let  the  complement  of  the  blocking  variable  be  the  new 
driving  variable  and  Increase  it  until  a  new  blocking 
variable  is  identified;  return  to  a). 

The  paraphrase  form  is  used  in  practice. 

THEOREM  12.  The  principal  pivoting  method  terminates  in  a  solution 
of  (1),  (2)  if  M  has  positive  principal  minors  (and,  in 

particular,  if  M  is  positive  definite). 

PROOF.  We  have  shown  that  the  completion  of  a  major  cycle  occurs 
in  a  finite  number  of  steps,  and  each  one  reduces  the  total  number  of 
variables  with  negative  values.  Hence  in  a  finite  number  of  steps, 
this  total  is  reduced  to  zero  and  a  solution  of  the  fundamental 
problem  (1),  (2)  is  obtained.  Since  a  positive  definite  matrix 

has  positive  principal  minors,  the  method  applies  to  such  matrices. 

As  indicated  earlier,  the  positive  semidefinite  case  can  be 
handled  by  using  the  paraphrase  form  of  the  algorithm  with  a  minor 
modification.  The  reader  will  find  details  in  [4]. 
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THE  PRINCIPAL  PIVOTTNG  METHOD  OF  QUADRATIC  FROORAHOBO 


By 

Richard  W.  Cottle 
Stanford  University 


I.  BACKGROUND 

Quadratic  programing  is  concerned  with  the  study  of  optimization  problems 

# 

which  can  be  posed  in  the  form 

(1)  minimize  Q(x)  ■  c^x  +  -^-x^Bx 

subject  to  Ax  >  b 

x  >  0 

Die  points  (vectors)  satisfying  the  side  conditions  or  constraints  of  the 
problem  (l)  are  sei<i  to  be  feasible  solutions  and  collectively  they  fora  the 
constraint  set 

(2)  (f  -  {xfcR*  |  Ax  >  b  ,  x  >  0  } 

which  could,  of  course,  be  empty,  in  which  case  the  problem  (l)  is  said  to  be 

infeasible.  But  empty  or  not,  'C  is  always  a  polyhedral  convex  set.  Die 

convexity  of  the  objective  function  Q(x)  is  quite  another  matter.  It  is 

well  known  that  a  quadratic  function  Q  is  convex  on  R8  if  and  only  if  its 
1  T 

"quadratic  part"  -^-x  Dx  is  a  positive  semi -definite  fora.  If  the  dimension 
of  C  is  less  than  n,  then  Q  could  bo  convex  on  X  without  being  convex  on  Rn. 
A  discussion  of  this  possibility  can  be  found  in  [6).  When  Q  is  convex  on  Rn, 


*  All  numerical  quantities  of  tiiim  problem  sure  understood  to  be  real  numbers. 
Die  vector  x  represents  an  n-tuple  of  variables  whose  values  are  to  be  deter¬ 
mined.  Die  matrix  A  is  assumed  to  be  of  order  m  by  n,  and  without  loss  of 
generality,  the  n -square  matrix  D  may  be  regarded  as  symmetric.  Die  superscript 
T  denotes  transposition.  Vector  Inequalities  are  equivalent  to  componentwise 
inequalities  of  the  same  type. 
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(1)  is  called  the  eoavex  quadratic  programing  problem.  It  is  a  genuine  exten¬ 
sion  of  the  linear  programing  problem  which  corresponds  to  the  case  in  which 
C  is  the  n ^square  zero  matrix. 

As  Implied  above,  our  study  stems  from  problems  of  the  form  (1).  'This 
is  not  restrictive,  in  the  class  of  linearly-constrained  quadratic  minimization 
problems.  For  instance,  there  is  a  simple  technique  for  converting  problems 
such  as 

(!’)  minimize  Q(x)  =  cTx  +  -jpX^Dx 

subject  to  Ax  =  b 

into  the  inequality -constrained  format  (l)  without  having  to  double  the  number 
of  constraints  and  variables.  This  is  treated  in  the  Appendix. 

In  (l),  we  seek  a  global  minimum  of  Q(x)  subject  to  x£  ^  ,  that  is,  ar. 
ic  €  &  satisfying  q(x)  <  Q(x)  for  all  x  €  C  .  Such  an  x  is  said  to  be  an 
optimal  solution  of  the  problem.  In  the  usage  adopted  here,  no  vector  can  be 
optimal  if  it  is  not  also  feasible. 

The  necessary  conditions  of  optimality  for  problem  (l)— found  by  applying 
the  celebrated  theorem  of  Kuhn  and  IXicker  [19  ]— state  that  if  x  is  an  optician 
solution  to  the  problem  (1),  there  exists  a  vector  y  bu  :h  that 
(3)  c  +  Dbc  -  A^y  >  0 

-b  +  Ax  >0 

x  >  0 
y  >  0 

x^[c  +  Dx  -  A^y]  «  0 

-T.  — 

y  [  -b  +  Ax  ]  »  0 

Morale**.  thf.ie  cond  ;tions  a^e  sufficient  when  (l)  is  a  convex  quadratic  program. 
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The  duality  theory  for  quadratic  programing  completely  embraces  that  for 
linear  programming.  Thus,  when  D  ie  positive  semi -definite,  the  primal 
problem  (l)  has  the  dual 

(1*)  maximize  F(x,y)  *  b  y  -  -g-x  Dx 

T 

subject  to  c+Dx-Ay>0 

(x>  0) 
y  >  0 

The  duality  of  the  pair  (l),(4)  was  first  discovered  by  Dorn  [12 J  and  Dennis  [11]. 
Later,  in  [3],  the  author  symmetrized  the  duality  theory  by  amending  the  primal 
problem  to  read 

(1 ' )  minimize  Q(x,y)  =  c-^x  +  -iy-y^Ey 

subject  to  -b  +  Ax  +  gy  >  0 

X  >  0 

(y  >  o) 

where  E,  like  D,  is  symmetric  and  positive  semi -definite.  The  Kiihn-Thcker 

conditions  for  (l ' )  are 

(3’)  c  +  'Dx  -  A^y  >  0 

-b  +  Ax  +  Ey  >0 
x  >  0 
y  >  0 

->V  —  Urn 

x  [c  +  Dx  -  A  y]  =  0 

y^[-b  +  Ax  +  Ey ]  *  0 


and  the  dual  of  (1  * )  is 
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(4')  maximize  P(x,y)  *  bTy  -  -|-xTDx  -  -|-yTEy 

T 

subject  to  c+Dx-Ay>0 

(x  >  0) 
y  >  0 

The  parentheses  around  sign-restricted  variables  indicate  that  these  restrid 
can  be  imposed  without  loss  of  generality  even  though  they  are  not  required  1 
the  validity  of  the  duality  theorems. 

For  any  solution  (x,y )  of  the  orthogonality  (or  "complementary  slackness") 
conditions 

x^[c  +  Ex  -  ATy  ]  =  0 
yT[-b  +  Ax  +  Ey]  =  0 

it  follows  that 

«  T  IT  IT 

Q(x,y)  =  c  x  +  -5-xVx  +  rjy-y  Ey 

=  4"^cTx  +  feTy^ 

*  bTy  -  «^-xTDx  -  -*-yTiy 

=  P(x,y) 

Therefore  the  value  of  either  quadratic  objective  function  P  or  Q  at  any  solut. 

IT  T 

of  (.5’)  is  readily  calculated  by  evaluating  the  Linear  function  -^~(c  x  +  b  y) 
As  might  be  imagined,  the  system  (31)  plays  a  central  role  in  the  soluti 
of  (!’)  and  (4’),  a  fact  stressed  by  Wolfe  [24].  In  order  to  simpJify  the 
manipulation  of  (}'),  it  has  been  found  advantageous  to  represent  the  block 
matrix 

f°  -aT) 

\A  E/ 


-  5  - 


by  the  single  letter  M  and  the  vectors  *  (yj  ^  z>  respectively. 

With  these  identifications,  (3')  takes  the  simpler  form 
(5)  q  +  Mz  >  0 

z  >  0 
zT[q  +  Mz]  -  0 

We  shall  call  this  the  fundamental  problem.  It  has  been  recognized,  however, 
that  systems  of  the  form  (5)  can  often  be  solved  without  relying  on  the  special 
structure  in  the  Identification  above.  (See  Dantzig  and  Cottle  [8]  and  Leake  [20]. ) 
The  structural  assumptions  can  be  replaced  by  more  general  properties  of  the 
matrix  M,  such  as 

(i)  positivity  of  all  principal  minors 
(ii)  positive  semi -definiteness 

or  generalizations  thereof.  One  such  generalization  is  treated  by  Lemke  [20], 
another  by  Ingleton  [18]  and  the  author  [7]. 

The  study  of  the  fundamental  problem  (5)  has  been  approached  in  two  ways: 
one  existential,  the  other  constructive.  As  the  names  suggest,  the  existential 
approach  is  concerned  with  conditions  which  imply  the  existence-— and  in  same 
cases,  the  uniqueness— of  solutions  to  thesystem,  whereas  the  constructive 
approach  concentrates  on  the  development  of  efficient  computational  procedures. 

The  two  approaches  are  not  completely  disjoint,  however.  For  example,  the 
principal  pivoting  method  described  below  gmsvers  the  existence  question  when 
the  data  M  and  q  art  specified  and  the  matrix  N  belong*  to  an  allowable  Class 
of  matrices.  It  Is  also  true  that  existential  studies  allow  one  to  predict  the 
eventual  discovery  of  a  constructive  treatment  of  the  problem. 
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II.  FRnfCIfifkL  pivonno 

Consider  a  p -square  matrix  M  and  a  p- vector  q.  For  any  p-vector  z  the 
expression  q  +  Mz  defines  a  mapping  W:RP  — ►  R5  and  we  let 
(6)  w  »  W(z)  ■  q  +  Mz 

*fe  think  of  (6)  as  a  system  of  p  linear  equations  in  2p  variables,  and  in  the 
*orm  above,  the  variables  z  are  Independent  vhile  the  variables  v  are  dependent. 

.r  the  terminology  of  linear  programming  [9],  the  independent  variables  are 
lpnbaslc  and  the  dependent  variables  are  basic. 

a  solution  (v,z)  to  equation  (6)  is  said  to  be  nondegenerate  If  at  most 
p  of  the  2p  components  w^, . . . ,w^,z^,. . . ,z^  equal  zero. 

To  pivot  in  (6)  or  cm  equivalent  system  is  to  solve  for  a  currently 
lonbasic  variable  in  terms  of  the  remaining  nonbasic  variables  and  one  of  the 
basic  variables.  Thus  pivoting  exchanges  the  roles  of  two  variables  with  re¬ 
spect  to  membership  in  the  basis.  The  specification  of  these  variables  single; 
out  a  particular  entry  in  the  matrix  of  columns  corresponding  to  the  nonbasic 
variables,  and  this  entry  is  called  the  pivotal  entry.  For  the  operation  tc  <o 
-gttimate,  it  is  necessary  and  sufficient  that  the  pivotal  entry  be  nonzero. 

More  generally,  a  block  pivot  in  (6)  or  an  equivalent  system  consists 
-  solving  for  a  set  of  k  currently  nonbasic  variables  in  terms  of  the  remain- 
ng  p  -  k  nonbasic  variables  and  a  set  of  k  basic  variables.  For  this  operat. 
f.o  be  possible,  it  is  necessary  and  sufficient  for  the  corresponding  pivotal 
clock  (submatrix)  to  be  nonsingular.  A  block  pivot  in  (6)  is  called  a  pr Inc, 
r.ii  pivot  if  the  pivotal  block  is  a  principal  submatrix  of  M. 

The  variables  wi,z1  (i  -  l,...,p)  are  said  to  a  v  *aplemcntary  pair 
,ii  each  in  the  complement  of  the  other.  The  set  of  basic  variables  is  said 
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to  be  complementary  (almost -complementary )  if  it  sontaine  no  (exactly  one) 
complementary  pair. 

If  the  system 


.  .  *  *  #  # 

(7)  v  ■  q  +  M  a 

is  obtained  from  (6)  by  a  principal  pivot,  it  is  possible  to  rearrange  the 
rows  and  columns  so  that  is  a  complementary  pair  for  each  1  »  l,..,,p. 

For  the  sake  of  the  discussion  below,  we  assume  that  this  is  always  done.  We 
call  (7)  a  principal  transform  of  (6).  If  P  is  a  permutation  matrix  of  the 
same  order  as  M,  then  the  congruent  matrix  P^TtP  is  called  a  principal  rearrange- 
ment  of  M. 


If  we  let  9  *=  {  1,. .  .  ,pj  and  \f.  ,  ^  @  , 


then  M 


denotes  the 


submatrix  of  M  formed  by  deleting  the  the  entries  except  the  m^  for  which  (i, J ) 
belongs  to  \JL  X  ^  .  ihus  ^  is  a  principal  suboatrlx  of  N  and  its  deter¬ 

minant  is  called  a  principal  minor  of  M.  It  is  a  standard  convention  to  define 
the  determinant  of  tne  empty  matrix  to  be  1. 

* 

It  is  clear  taat  when  M  is  a  p -square  matrix  and  M  is  a  principal  re¬ 
ft 

arrangement  of  M,  every  principal  suboatrlx  of  M  is  a  principal  submatrix  of 
M.  Consequently,  principal  rearrangement  preserves  the  character  of  principal 
minors.  Also  clear  is  the  fact  that  if  M  is  positive  semi -definite,  so  is  any 
of  its  principal  rearrangements. 

Using  the  notation  and  definitions  above,  we  may  now  state  a  result  of 


Tucker  [25]. 

# 

IHBOKEN  1.  If  N  has  positive  principal  minors  and  N  is  a  principal  transform 

of  M,  then  N  has  positive  principal  minors. 

* 

IROOF.  Suppose  N  is  obtained  from  M  by  a  block  pivot  on  the  principal  suboatrlx 
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MJ4  .  Then  the  conclusion  follows  free  another  result  of  Ticker  [22,  Theorem  J] 
which  implies  that  for  all  ^ 


(8) 


det  M, 


*  det  M 


/  det  M. 


U  My 

where  A  represents  the  symmetric  difference  operation: 

4a}  -  (4  <j  }  )  -  (4  n  }  ) 

On  the  strength  of  this  theorem,  we  say  that  the  class  of  p-equare  matrices 
having  positive  principal  minors  is  invariant  under  principal  pivoting.  The 
formula  (8)  yields  a  similar  invariance  theorem  for  the  class  of  matrices  with 
nonnegative  principal  minors. 

The  class  of  positive  (semi -)def ini te  matrices  of  a  given  order  is  also 
invariant  under  principal  pivoting. 

THEOREM  2.  If  M  is  a  p-square  positive  (semi-)definite  matrix  and  M  is  a 
principal  transform  of  M,  then  M*  is  positive  ( semi -)def ini te. 

PROOF.  We  may  assume  that 


„ .  ( H44  %<} 

‘V 


and  that  M  is  obtained  from  M  by  a  block  pivot  on  the  principal  submatrix 
m<M  If  z  m  (zj  ,s^  )  and  w  ■  (v^  ,v^  )  are  defined  conformally,  we 

may  write 


The  quadratic  form  z^Mz  can  then  be  expressed  as 


’4  '  m4,4  %  +MJ,}  > 

i  *  MJ/4  \»  *  V*  zy 


I  ’me 


T 

z  a  v. 


T 

+  z;  v. 


4  ’4  T  >  > 

After  the  block  pivot  on  M.  .  we  obtain 

Jt.aL 
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l4  3  V  ^ 


"  *)4  m44  V  +  (MH  '  V  *k  >  )i:f 


and  hence 


# 

M  a 


M 


-1 


Vm£  “).}  -  V!mk-*> 

Hie  associated  quadratic  form  is  given  by 


T  T  T 

V»  z,  +  z;  w*  «  Zi  w. 


Therefore,  the  range  of  the  quadratic  form  is  invariant  under  principal  pivoting. 

* 

When  M  is  positive  semi -definite,  M  must  be  also  as  well.  If  M  is  poeltlve 
definite,  the  quadratic  form  z^Mz  is  nonnegative  for  axl  z  and  vanishes  only 
when  z  =  (z.  ,z  )  *  0.  Therefore  if  (w*  ,z.  )V(wj  ,z4  )  vanishes,  it 

*  V  *  r  J  r. 

follows  from  the  relations  above  that  (w.  ,z )  •  0.  Hence  N  is  positive 

4 

definite . 


REMARK.  It  should  be  carefully  noted  that  no  assumption  of  sysswtry  on  M  has 
been  used.  For  the  application  suggested  in  Section  I,  it  would  be  inappropriate 
to  be  hampered  by  such  a  restriction. 

As  we  shAll  see,  the  principal  pivoting  method  for  solving  (5)  relies 
rather  heavily  on  these  invariance  theorems.  The  method  consists  of  a  finite 
sequence  of  principal  (block)  pivots.  Bach  such  block  pivot  which  is  not  a 
"simple"  principal  pivot  amounts  to  a  finite  sequence  of  simple  nonprincipal 
pivots  each  of  which  produces  an  almost -complementary  set  of  basic  variables 
in  the  current  transform  of  (6).  This  means  that  properties  of  positive  princi¬ 
pal  minors  or  positive  semi -definiteness  can  temporarily  be  lost.  It  will 
becosw  clear  in  the  sequel  that  the  following  facts  are  helpful. 
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THEOREM  3*  Let  A  -  (ft^ )  be  a  2 -square  positive  teal -definite  matrix.  If 
an  -  0,  then  +  *2i  "  °* 

PROOF.  The  associated  quadratic  fora  is 

<*12  *  *21JV2  **22^i° 

for  all  x^Xg.  I*  +  *21  ls  “JFthlng  but  zero,  the  inequality  cannot  hold 
for  all 


THEOREM  4.  Let  A  *  (a^)  be  a  2-square  matrix  having  the  properties: 

(1)  au  <0; 

(H)  *22  £  °» 

(Hi)  au  +  *21  K  0i 

(iv)  if  a^  <  0,  then 


11 


1/a 


11 


(*11*22  *  *12*21^*11  *21^* 


Is  positive  seal -definite; 
(v)  if  agl  <  0,  then 


11 


*2  * 


*u/*2i  (*12*21  '  *11*22  ^*21^ 


1/*21  ”*22^*21 
Is  positive  seal -definite. 
Then  A  must  have  the  properties: 


(vi)  a^  >  0; 

(vH)  *22  ^  °* 

(vill)  a^  ♦  agg  >  0. 


-  11 


PROOF.  Hie  first  thiee  properties  of  A  imply  that  a^  <  0  or  ag,  <  0— And 
perhaps  both  are  negative. 

CASE  I.  If  a^  <  0,  the  principal  minore  of  A^  are  nonnegative  since  it 

* 

is  positive  Geni-definite .  Li  particular,  a^  >  0  and  Sgj,  >  0.  If  both  were 
zero,  it  vould  follow  from  Theorem  3  that  1/a^  »  0  which  is  absurd.  Bence 
the  properties  (vi)  to  (viii)  hold. 

CASE  II.  If  a?1  <  0  and  ■  0,  the  positive  semi -definiteness  of  Ag 
implies  a  *  *1/a21  >  0  and  a22  >  Hence  the  required  conditions  hold  again. 

The  theorem  can  be  varied  and  established  even  more  easily  for  the  case  of 
matrices  having  positive  principal  minors.  The  proof  is  CM&tted. 
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III.  SOME  FBOPERTIES  OF  THE  FUCMKOrCHI-  SYSTEM 
The  linear  Inequalities 

(9)  q  +  Mz  >  0 

z  >  0 

or  equivalently, 

(9')  v  •  q  SsMz 

v  >  0 
z  >  0 

will  be  called  the  fundamental  -system.  For  the momaht,  we  will  assume  only 
that  M  is  a  p-square  matrix,  q  £  R*1,  z  £  R^. 

It  is  a  straightforward  consequence  of  Farkas'  Theorem  [13  ]  that  (9) 
has  no  solution  if,  and  only  if,  there  exists  a  p-vector  v  satisfying 

(10)  A  <0,  yTq  <  0,  v  >  0 

It  has  been  shown  [3)  that  when  M  has  positive  principal  minors,  (10)  has  no 

solution,  and  hence  (9)  is  consistent  regardless  of  what  q  may  be.  Moreover, 
it  was  shown  there  that  In  this  case,  the  fundamental  problem  rlvays  has  a 
solution;  the  uniqueness  of  the  solution  was  first  pointed  ait  by  Ingleton 
[l8 )  and  by  the  author  in  [7).  The  key  ingredient  in  the  proof  is  the  fact, 
due  to  Qale  and  Hikaldo  [13],  that  M  has  positive  principal  minors  if,  and  only 

(11)  x1(Kz)1  <0,  1  -  l,...,p  implies  1  *  0 

In  the  positive  semi -definite  case,  the  fundamental  system  need  not  be 
consistent,  but  Shan  it  is,  the  fundamental  problem  (3)  has  a  solution.  If 
the  solution  in  also  nondegenerate,  it  is  unique.  9ee  [fc]  (existence)  and 
[20}  (uniqueness). 
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Hie  principal  pivoting  method  ia  applicable  to  fundamental  problems  (5) 
in  which  M  has  positive  principal  minors  or  ia  positive  seal -definite.  It 
works  with  solutions  of  w  =  q  +  Mz  rather  than  with  solutions  of  the  full 
system  (9' ).  Hence  no  a  priori  information  regarding  feasibllltjr/need  be  given. 
For  this  reason,  the  method  incorporates  a  devise  for  recognising  infeasibility 
when  it  occurs  in  the  positive  semi -definite  case.  The  device  rests  on  Theorem  4 
and  the  following  result  proved  in  [10]  by  Dantzig  and  the  author. 

THEOREM  5*  Let  M  be  a  p -square  positive  semi -definite  matrix,  and  let  q6  R^. 

If  for  some  index  r,  1  <  r  <  p, 

(1)  <o 

(ii)  mrr  =  0 
(ill)  mlr  >  0 

the  system  (9')  has  no  solution. 

This  is  proved  by  noting  that  the  hypotheses  lead  via  Theorem  5  to  the 
conclusion  that  the  r-th  equation 

"r-Vg  "rJzJ 
* 

can  have  no  nonnegative  solution. 

Although  it  seems  unimportant  from  the  computational  standpoint,  it  Is 
interesting  to  see  that  for  a  Large  class  of  matrices,  the  solutions  to  (9) 
must  fora  either  an  empty  or  an  unbounded  set. 

*  It  is  worth  mentioning  that  Theorems  3  and  5  are  valid  for  the  class  of 
"copositive  plus"  smitrlces  introduced  by  Leake  (20,  Theorem  k).  However,  our 
Theorem  2  is  not  valid  for  this  class  of  matrices,  and  thereto  lies  a  limita¬ 
tion  of  the  principal  pivoting  method. 
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THEOREM  6.  Let  M  be  a  p-square  matrix  and  let  q  €  R**.  The  set 
Z(q,M)  =  [z  €  R1*  I  9  +  M*  >  0,  *  >  0^ 
is  unbounded  if,  and  only  if,  it  is  nonempty  and 
(12 j  v^i  <0,  w  >  0 

has  no  solution. 

ER00F.  Suppose  Z(q,M)  is  nonempty.  By  a  theorem  of  Goldman  [l6j,  Z(q,M)  is 
unbounded  if  and  only  if  Z(0,M)  /  .  But  by  a  standard  alternative  theorem 

(see  Gale  [14,  Theorem  2.10]),  this  is  so  if  and  only  if  (12)  has  no  solution. 

Hie  class  for  which  (12)  has  no  solution  includes  copositive  matrices 
(i.e.,  those  for  which  z*Mz  >  0  for  all  z  >  0)  and  adequate  matrices  (as  defined 
by  Ingleton  [18  ]);  therefore  the  class  includes  all  positive  semi -definite 
matrices  and  all  those  with  positive  principal  minors. 

As  an  application  of  Theorem  6,  consider  the  interpretation  of  (9)  as  the 
set  of  constraints  obtained  by  talcing  a  convex  quadratic  program  and  composing 
its  constraints  with  those  of  its  dual,  as  done  in  Section  I.  If  aifcher  the 
primal  or  the  dual  constraint  set  is  nonempty,  then  at  least  one  of  the;i  must 
be  unbounded .. This  generalizes  results  of  Clark  [2],  Charnes,  Cooper,  and 
Thompson  [1  ] ,  and  Lemke  [20 ] . 
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IV.  THE  PRINCIPAL  PIVOTING  METHOD 

In  this  section,  we  shall  present  a  treatment  of  the  principal  pivoting 
method  first  proposed  by  Dantzig  and  the  author  [10].  The  method  is  applicable 
to  mt.  ..rices  with  positive  principal  minors  (sc-called  P-aatrices)  and  to  those 
which  are  positive  semi -definite.  Since  the  method  can  be  stated  more  simply 
for  the  class  of  P-matrices,  we  begin  there  and  subsequently  broaden  the  dis¬ 
cussion  to  the  positive  semi -definite  case.  First  tnough,  we  make  sate  general 
remarks. 

It  Is  convenient  to  represent  (6)  in  tabular  form  as  follows: 

1  z,  ...  z 


£ 


wl“ 

ql 

“ll 

•••  % 

W2  " 

• 

*2 

• 

“21 

• 

®2p 

• 

• 

* 

n 

V 

• 

•  •  •  Opp 

The  variables  to  the  left  of  the  box  are  basic  and  those  above  it  are  nonbaslc. 

The  number  of  negative  components  in  q  is  called  the  index  for  the  funda¬ 
mental  problem  (5).  If  the  index  is  zero,  then  q  >  0  and  z  »  0  solves  (5). 
Obviously  (5)  has  no  computational  interest  unless  its  index  is  positive.  Then, 
if  we  can  achieve  an  equivalent  system  with  an  index  of  zero,  the  original  system 
is  readily  solved. 

We  denote  by 

(6.V)  .<">  .  ,<*)  +  mK(v) 

the  system  obtained  from  (6.0)  after  V  iterations .  The  system  (6.0)  id  Just  (6). 
We  say  that  (6.V)  is  complementary  if  (w^,z^)  *  »  for  *  *  l*.*.*p. 


Starting  from  a  complementary  system  (6.V)  with  a  positive  index,  we  consider 
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/ 

the  solution  (w^,z^)  *  (q^,0)  and  select  a  particular  negative  component, 

w<v) ,  which  will  be  called  the  distinguished  variable .  Our  isnedlate  objective 

is  to  make  increase  to  zero  without  allowing  any  variable  already  nonnegative 
r 

to  became  negative.  In  the  case  of  P-matrices  which  we  now  consider,  this  can 
always  be  accomplished  by  a  finite  number  of  simple  pivots  which  result  in  a 
principal  (block)  pivot. 

Notice  that  if  M  =  is  a  P -matrix,  so  is  in  any  equivalent  comple¬ 
mentary  system  (6.v),  i.e. ,  in  any  principal  pivotal  transform  of  (6.0).  Conse* 
quently,  the  diagonal  entries  of  are  positive,  and  in  particular, 

av(’V(v)  -  >  o 

r  '  r  rr 

Verbally,  the  increase  of  the  nonbasie  complement  z^  of  the  distinguished 

r 

basic  variable  w^  forces  the  latter  to  increase.  In  this  role,  tty  is  called 
r  r 

to) 

the  driving  variable .  Increasing  the  variable  z^;  '  to  the  positive  value 
drives  w ^  up  to  the  value  zero. 

Basis  Exit  Rule.  The  driving  variable  is  governed  by  a  rule  which  states  that 
its  increase  must  stop  as  soon  as  a  positive  basic  variable  decreases  to  zero  or 
the  distinguished  variable  increases  to  zero.  The  variable  which  limits  the 
increase  of  the  driving  variable  is  called  the  blocking  variable .  In  this  case, 
the  existence  of  the  blocking  variable  is  clear;  nondegeneracy  guarantees  Its 
uniqueness. 

The  blocking  variable  w^  and  the  driving  variable  determine  a  pivotal 
entry  in  the  tableau  corresponding  to  the  basis  exchange  in  which  the  driving 
variable  jwplaces  the  blocking  variable.  Biere  are  two  cases.  Case  A.  If  s  »  r, 
then  the  exchange  of  wj^  for  z ^  is  a  principal  pivot  which  produces  an  equi¬ 
valent  complementary  system  (63M)  In  which  is  a  P-matrix.  Moreover, 


(6: VH )  baa  a  lover  Index  than  ( 6.V )•  If  Bin  '  1 '  <  0,  a  new  dlatlnfulohed 

441) 

variable  Is  determined  and  the  process  is  repeated.  It  aln  7  >  0,  a  solu¬ 
tion  of  the  fundamental  problem  Is  at  hand,  vis.,  •  (q^+^,0). 

Case  B.  If  s  4  T>  then  m^  <  0,  and  the  exchange  of  and  Is  e  non- 

sr  s  r 

principal  pivot.  Before  the  pivot, 


mM 

m(y) 

a 

rr 

“rs 

.M 

sr 

ss 

of  the 

P-oa- 

fy) 

i.i  a  principal  2-square  suboatrlx  of  the  P-oatrlx  Vl  .  Am  such,  It  Is  a  P 

(Vi 

litrlx.  After  the  pivot  on  m^.7  this  2-square  matrix  becomes 

t  U)  /,«/>>  (.<*).(*’)-  .W^^W  \ 

'  1  I  TT  fir  y  v*a  nr  rr  mm  1 


i  rr  '  sr  '  ra  sr  rr  as  1 

V  1*M  H.W/.O')  ) 

\  •  sr  ss  7  sr  / 

.0  which  Theorem  V  applies,  although  in  this  case,  Its  conclusion 

(15)  (m^m^  -  >  0,  -m^fym^  >  0 

'  7  '  rs  sr  rr  ss  77  sr  *  ss  7  sr 

la  obvious  from  first  principles.  Bse  pivot  has  (l)  left  tile  distinguished 
variable  basic  at  a  negative  value,  and  (ii)  made  the  driving  variable  basic 
and  the  blocking  variable  non  basic.  Bov,  the  system  is  almost  -complementary 

since  the  distinguished  variable  and  its  complsamnt  are  basic  while  the  blocking 

## 

variable  and  its  complement  are  nonbasle.  . 

One  ssklidnt  feature  of  the  2-square  matrix  (lU)  is  that  its  rows  correspond 
to  the  basic  pair  and  its  columns  correspond  to  the  nonbasle  pair  in  the  current 
tableau.  We  shall  call  it  the  pair  matrix.  The  pair  matrix  lit  defined  only  for 
an  almost -ccmplenentary  system. 


e 


*  The  Donbas i c  t-qlr . 


Bas*8  Entry  Rule .  The  next  variable  tc  enter  the  basis  (i.e.,  the  next  driving 
variable)  Is  the  complement  of  .the  blocking  variable  vfaich  Just  became  nocbaala. 
Its  increase  Is  also  governed  by  the  basis  exit  rule  above.  Notice  that  (15) 
Implies  that  both  members  of  the  basic  pair  will  increase  with  increases  in  the 
new  driving  variable.  Hence  the  existence  of  a  blocking  variable  is  assured. 
Indeed,  vhen  M  is  a  P-matrlx,  the  distinguished  variable  is  alvays  potentially 
a  blocking  variable.  If  the  distinguished  variable  is  not  actually  the  block¬ 
ing  variable  at  a  particular  iteration,  the  pivotal  entry  is  again  negative. 

By  the  algebra  of  pivoting,  it  follows  that  a  new  pair  matrix  is  obtained  to 

■g. 

which  Theorem  4 4  applies.  Part  of  the  applicability  of  Theorem  4'  stems  from 
the  invariance  of  P-matrices  under  principal  pivoting.  The  constant  applica¬ 
bility  of  Theorem  4'  accounts  for  the  fact  that  both  members  of  the  basic  pair 
will  increase  as  the  current  driving  variable  increases.  Since  nondegeneracy 
implies  that  strictly  positive  increases  of  the  driving  variables  are  alvays 
allowed,  the  procedure  must  drive  the  distinguished  variable  up  to  zero  at  which 
time  it  is  the  blocking  variable;  the  corresponding  pivot  restores  the  comple¬ 
mentarity  of  the  system. 

As  in  linear  programing,  the  sequence  of  steps  by  which  the  distinguished 
variable  is  driven  to  zero  is  finite.  For  there  are  only  finitely  many  basic 
solutions,  each  of  which  corresponds  to  a  unique  set  of  values  of  the  basic  vari¬ 
ables.  Since  the  distinguished  variable  and  its  complement  increase  strictly 
from  one  iteration  to  the  next,  no  basis  can  be  repeated.  The  finitemess  of  the 

overall  procedure  now  follows  from  the  fact  that  the  index  never  increases. 

*  In  this  application,  khte  matrices  and  Ag  are  interpreted  as  pivotal  trans- 

of  the  pair  matrix  which  would  make  thorn  principal  submatriees  of  a  P-oatrix. 


Modifications  for  the  positive  semi -definite  case.  When  the  matrix  M  in  the 
fundamental  problem  (5)  is  positive  semi -definite,  its  principal  minors  are 
nonnegative  rather  than  strictly  positive.  This  causes  certain  complications 
which  call  for  special  handling. 

Indeed,  the  system  (6)  need  not  have  a  nonnegative  solution  in  this  case. 

In  the  modified  procedure,  the  absence  of  an  appropriate  pivoted  element  or, 
equivalently,  the  existence  of  an  unblocked  driving  variable  detects  this 
possibility.  Ihis  is  accomplished  with  Theorems  5>  and  2.  However,  it  is 
necessary  to  incorporate  an  artifice  to  handle  the  situation  in  which  the 
driving  variable : 

(i)  has  no  effect  on  the  negative  distinguished  variable; 

(ii)  makes  at  least  one  other  negative  basis  variable  decrease; 

(ill)  makes  no  positive  basic  variable  decrease. 

Without  modification  these  conditions  would  signal  an  unblocked  driving  variable, 
and  we  want  this  situation  to  indicate  that  no  solution  to  the  fundamental  problem 
exists.  Hie  artifice  we  use  is  to  impose  a  lower  bound  ^  <  min  on  all 

negative  basic  variables.  A  negative  variable  can  therefore  block  the  driving 
by  decreasing  to  its  lower  bound;  if  this  happens,  the  variable  becomes  non- 
basic  at  its  lower  bound  value,  JJ  .  However,  once  a  variable  becomes  nonnega¬ 
tive,  zero  is  its  lower  bound. 

Hiis  modification  necessitates  a  change  in  the  notion  of  basic  solution. 

A  basic  solution  now  is  one  in  which  the  nonbaslc  variables  are  set  at  their 
current  lower  bound  values,  either  0  o rfi  .  A  solution  is  nondegenerate  if  at 
most  p  of  the  2p  variables  of  the  problem  equal  0  or  |3  .  Again  the  nondegener¬ 
acy  of  all  solutions  is  assumed. 


The  basis  entry  and  exit  rules  for  tlie  positive  semi -definite  case  are 
as  stated  above.  However,  It  could  hippen  that  the  distinguished  variable  is 


also  the  nonbasic  driving  variable,  and  in  this  case,  it  could  be  self -blocking. 

The  principal  pivoting  method  for  both  of  the  cases  discussed  above  can 
be  sumaarized  by  the  following  diagram. 


FLOW  DIAGRAM  FOR  THE  PRINCIPAL  PIVOTING  METHOD  OF  QUADRATIC  PROGRAMMING 
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As  before,  the  procedure  does  not  allow  nonnegative  variables  to  became 

negative.  Bach  return  to  a  test  of  the  current  "q-column"  corresponds  to  a 

# 

complementary  system  with  lower  index  them  its  predecessor.  Hence  only 
finitely  many  returns  eire  possible. 

It  remains  to  show  that  after  each  nonterminating  q-test,  a  return  to  the 
q-test  will  occur  after  finitely  many  steps  unless  the  driving  variable  is  unW 
blocked,  in  which  case  the  problem  cannot  be  solved  due  to  the  infeaslbllity 
of  (9).  As  stated  earlier,  these  facts  are  attributable  to  the  nondegeneracy 
assumption  and  Theorems  5,  4,  and  2. 

Suppose  we  start  from  a  complementary  system.  If  the  driving  variable  Is 
unblocked.  It  cannot  also  be  the  distinguished  variable,  since  a  distinguished 
driving  variable  must  be  a  negative  nonbasic  variable  which  would  not  be  in¬ 
creased  beyond  zero.  Therefore  an  unblocked  driving  variable  In  a  complementary 
system  has  a  negative  basic  complement  upon  which  Its  increase  has  no  effect. 
Being  unblocked,  the  driving  variable  must  correspond  to  a  nonnegative  column 
in  the  current  tableau.  Hence  conditions  (i),  (11 ),  and  (ill)  of  Theorem  5 
hold  and  the  system  has  no  nonnegative  solution. 

If  the  driving  variable  is  blocked  but  not  by  Itself  or  Its  complement, 

the  first  system  after  the  pivot  Is  almost -complementary  and  contains  a  pair 

matrix.  Whenever  an  almost -complementary  system  Is  obtained,  the  most  recent 

pivotal  element  is  negative.  The  invariance  theorems  guarantee  the  required 

properties  on  the  matrices  A^,  Ag  In  Theorems  4  and  4'. 

*  It  was  pointed  out  by  T.D.  Persons  In  a  private  communication  that  It  is 
possible  to  return  to  the  q-test  and  find  that  column  nonnegative  while  sake  of 
the  nonbasic  variables  are  at  negative  values.  In  such  a  case,  setting  all 
basic  variables  equal  to  0  solves  the  problem. 
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If  the  driving  variable  in  an  almoat-ccmplementary  system  is  unblocked, 

Its  corresponding  column  In  the  current  tableau  must  be  nonnegative  (for  other- 

vise  some  basic  variable  vould  block  it).  Since  the  distinguished  variable  is 

not  blocking,  the  entry  of  the  pair  matrix  corresponding  to  the  driving  variable 

* 

and  the  distinguished  variable  is  zero,  and  the  other  entry  in  that  column  of 
the  pair  matrix  is  positive.  Pivoting  on  the  latter  restores  complementarity 
to  Vto  system.  After  suitable  reordering  of  the  columns,  it  is  possible  to 
apply  Theorem  5  and  declare  infeasibility. 

Each  iteration  of  the  method  Increases  the  sum  of  the  distinguished 
•ariable  and  its  complement,  because  the  nondegeneracy  assumption  implies  that 
driving  variable  can  always  be  increased.  Since  there  are  only  finitely 
via:v  bases  and  finitely  many  basic  solutions  corresponding  to  e%ch,  it  is  im¬ 
possible  to  return  to  a  previously  encountered  basic  solution,  and  therefore 
only  finitely  many  steps  are  required  to  detect  infeasibility  or  produce  a 
complementary  system  with  lower  index  than  the  previous  one. 


■*  This  could  not  occur  in  the  case  of  P-matrices. 
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V.  A  COMPARISON  WITH  LBMKE'S  METHOD 

The  principal  pivoting  method  invites  comparison  with  the  very  interesting 
approach  of  Lemke  [20].  Leake  'b  method  can  be  viewed  as  a  sequence  of  almost- 
complementary  pivots  resulting  in  one  grand  principal  pivot  which  is  completely 
determined  after  the  fundamental  problem  (5)  is  embedded  in  a  larger  one: 


T 

where  e  «  (l,...,l),  ^  is  a  suitably  large  scalar,  and  w0,»Q  is  *  pair  of 
complementery  artificial  variables.  A  solution  of  (16)  in  which  aQ  ■  0  is 
clearly  a  solution  of  \be  original  fundamental  problem  (5). 

Leake  has  shown  that  (l6)  has  a  solution  for  any  p-square  matrix  M.  The 
question  then  becomes  the  significance  of  a  solution  to  (5)  in  which  >  C 
and  hence  wQ  -  0.  For  a  large  class  of  matrices  which  includes  the  positive 
semi -definite  cles^  the  answer  is  that  the  fundamental  problem  (5)  has  no 
feasible  solution.  The  author  and  Dantxig  [8]  have  shown  that  (1 6)  has  no 
solution  with  *q  >  0  when  M  is  a  P-aatrix.  Hence  Leake's  method  can  always 
solve  (3)  when  M  is  a  P-aatrlx. 

As  mentioned  in  an  rrlier  section,  the  principal  plvotli^  method  is  not 
applicable  to  the  entire  clase  of  copositive  plus  matrices  introduced  by  Lemke 
since  that  class  is  not  invariant  under  prisicipal  pivoting.  It  the  author's 
knowledge,  the  two  methods  have  never  been  systematically  compared  os  data  to 


which  they  are  both  applicable. 


The  Lemke  procedure  begins  with  a  noaprincipal  pivot  in  the  column. 
;J1  subsequent  systems  except  the  Ip  jt  are  almost -camplementa.y  and  contain  ti 
basic  pair  wo'zO"  Wll®n  M  is  P°si,tive  seal  “definite,  so  is 


\e  H  j 

and  Theorem  4  can  be  applied  to  show  that  w^  and  zn  are  nonincreasing  while 
Wq  +  zQ  is  strictly  decreasing  witn  increases  cl  successive  driving  variable; 
However,  this  result  does  not  hold  for  arbitrary  matrices. 
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VI.  APPENDIX 

It  is  well  known  that  an  equation  is  equivalent  to  a  pair  of  inequalities. 
Therefore,  the  system  of  linear  equations 

(17)  Ax  =  b 
earn  always  be  written  as 

(18)  Ax  >  b 

-Ax  >  ~b 

This  doubles  the  number  of  linear  constraints  satisfied  by  x.  If  a  system  of 
linear  inequalities  equivalent  to  (17)  is  desired,  a  smaller  system  than  (18) 
can  be  used.  (Actually,  the  system  will  be  smaller  only  when  m  >  1. )  Since 
(17)  is  Just 


(17’) 

it  is  equivalent  to 

(19) 


11 

/  a.  .x.  =  b.  i  =  l....,m 
t— :  i.1  .1  1  *  * 


J=l 


n 

?  A*  <x,,  >  1  =  1, •••,m 

1=1  J  “ 


n  m  m 

kL  ^*^aii^xi  —  *^“^i 
j-i  i=i  J  J  i=i 

which  is  a  system  of  m  +  1  linear  inequalities. 

Another  often-used  fact  is  that  any  real  number  ^  can  be  represented  as 
the  difference  of  two  nonnegative  real  numbers; 

%  -  f >0,  >  0 

Thus,  variables  which  are  not  sign  restricted  can  be  represented  by  the  difference 
of  two  nonnegative  variables.  In  a  problem  such  as  (l')  where  all  the  variables 
are  unrestricted  in  sign,  this  device  would  double  the  number  of  variables.  This 
duplication  is  also  unnecessary.  It  suffices  to  write 


-  *q  ,  *q  >  °»  xj  >  0, 


1,. . .  ,n 


This  Increases  the  number  of  variables  by  1. 
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PREFACE 


In  this  Memorandum,  some  basic  problems  concerning  flow 
networks  are  surveyed  and  extended  to  two  more  general 
structures:  frames  of  real  vector  spaces  and  blocking 
systems . 


SUMMARY 


This  paper  surveys  some  basic  problems,  theorems  and 
constructions  for  flow  networks,  and  shows  how  these  can  be 
extended  to  more  general  combinatorial  structures. 

One  of  the  generalizations  can  be  roughly  described 
as  that  obtained  by  replacing  the  vertex— edge  incidence 
matrix  of  an  oriented  network  by  an  arbitrary  real  matrix. 
This  leads  to  the  notion  of  a  frame  of  a  subspace  of 
Euclidean  n-space,  a  concept  very  closely  allied  to  that 
of  a  real  matric  matroid.  Our  treatment  relates  matroid 
theory  and  linear  programming  theory,  and  thus  provides 
another  viewpoint  on  linear  programming,  and  in  particular, 
on  digraphoid— programming. 

In  the  last  part  of  the  paper  a  very  general  combina¬ 
torial  structure  called  a  blocking  system  is  given  an  axio¬ 
matic  formulation.  These  systems  have  arisen  in  a  variety 
of  contexts,  including  multi-person  game  theory  and  abstract 
covering  problems.  It  is  shown  that  one  of  the  network 
theorems  surveyed  in  the  first  part  of  the  paper  extends 
to  all  blocking  systems,  and  indeed  characterizes  such 
systems . 
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NETWORKS,  FRAMES,  BLOCKING  SYSTEMS 


INTRODUCTION 

In  this  paper  we  survey  a  few  basic  problems,  theorems, 
and  constructions  concerning  flow  networks,  and  describe 
how  some  of  these  can  be  extended  to  more  general  structures. 

The  paper  is  divided  into  three  parts. 

Most  of  the  material  of  Part  I,  which  deals  with 
networks,  can  be  found  in  Ford  and  Fulkerson  [8],  or  in 
earlier  papers  by  the  same  authors.  In  the  main,  we  limit 
the  discussion  in  Part  I  to  four  network  problems:  maximum 
flow,  minimum  path,  maximum  capacity  path,  and  the  length- 
width  inequality. 

Part  II  extends  this  discussion  to  arbitrary  real 
matrices  by  making  use  of  what  we  call  the  frame  of  a 
subspace  of  Euclidean  n-space,  a  notion  very  closely  related 
to  that  of  a  real  matric  matroid.  In  particular.  Part  II 
can  be  specialized  to  a  subclass  of  real  matric  matroids 
introduced  and  studied  by  Tutte  [1/1],  and  called  by  him 
regular  matroids.  Regular  matroids  have  been  recently 
re- investigated  by  Minty  [24],  who  has  given  another  system 
of  axioms  for  a  dual  pair  of  regular  matroids.  The  resulting 
structure  is  called  a  digraphoid  in  [24],  where  it  is  shown 
that  some  of  the  main  theorems  of  network-programming 
generalize  to  digraphoid-programming.  Our  treatment  provides 
another  viewpoint  on  digraphoid-programming,  and  indeed  on 
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linear  programming  in  general.  It  is  shown  in  Part  II  that 
the  main  theorems  of  Part  I  have  direct  analogues  for 
arbitrary  real  matrices.  We  want  to  emphasize,  however, 
that  the  special  network  algorithms  of  Part  I  do  not,  so 
far  as  we  know,  have  such  analogues.  Fven  for  the  case  of 
digraphoid-programming,  we  know  of  nothing  better 
computationally  than  the  simplex  method  of  Dantzig  [ 3 ] - 
While  the  simplex  method  has  proved  to  be  a  powerful  tool, 
both  theoretically  and  computationally,  it  is  not  yet  known 
whether  it  is  a  good  algorithm,  in  the  technical  sense 
stressed  by  Edmonds  [6],  whereas  the  network  algorithms  of 
Part  I  are  good  in  this  sense. 

In  Part  III  a  very  general  combinatorial  structure, 
which  we  call  a  blocking  system,  is  given  an  axiomatic 
formulation.  These  systems  have  arisen  previously  in  a 
variety  of  contexts,  including  multi-person  game  theory 
[29]  and  abstract  covering  problems  [14,  21,  22].  They 
have  recently  been  studied  by  Lehman  [22],  who  has  given 
conditions  on  a  blocking  system  in  order  that  a  max-flow 
min-cut  equality  or  a  length-width  inequality  hold,  and 
also  by  Edmonds  and  Fulkerson  [7],  who  have  shown  that  one 
of  the  network  theorems  of  Part  I  extends  to  all  blocking 
systems,  and  indeed  characterizes  such  systems. 
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PART  I •  NETWORKS 

1.  MAXIMUM  FLOW 

Let  G  be  a  graph  with  edge  set  E  and  vertex  set  V. 

Both  E  and  V  are  assumed  finite.  The  two  ends  of  an  edge 
may  be  distinct  vertices  or  the  same  vertex;  in  the  latter 
case  the  edge  is  frequently  called  a  loop.  We  also  allow 
multiple  edges  joining  the  same  pair  of  vertices,  or 
multiple  loops  on  the  same  vertex- 

It  will  be  convenient  in  this  section  to  orient  G  by 
distinguishing  one  end  of  each  edge  as  positive  and  the 
other  as  negative.  For  a  loop  these  coincide*  If  e  e  E 
has  positive  end  u  c  V,  negative  end  v  ».  V,  we  sometimes 
write  e  (u,  v) .  For  each  edge  e  c  E  and  vertex  v  t  V  we 
define  an  integer  a(v,  e)  as  follows.  If  v  and  e  are  not 
incident,  or  if  e  is  a  loop,  then  a(v,  e)  *  0.  Otherwise 
a(v,  e)  *  1  or  -1  according  as  v  is  the  positive  or  negative 
end  of  e.  We  call  the  resulting  matrix  the  vertex-edge 
inc idence  matrix  of  G. 

Suppose  now  that  each  edge  e  «.  E  has  associated  with 
it  a  nonnegative  real  number  c(e),  the  capacity  of  e-  Let 
s  and  t  be  two  distinguished  vertices  of  G-  A  (feasible) 
flow,  of  magnitude  (or  amount )  i,  from  s  to  t  in  G  is  a 
real-valued  function  x  with  domain  E  that  satisfies  the 
linear  equations  and  inequalities 

v*  1  v  *  s, 

(ID  '  a (v ,  e)x(e)  ■  -a,  v  •  t, 

e-  E  (  0,  v  4  s .  t 
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(1.2)  -  c(e)  ^  x(e)  £  c(e),  e  e  E. 

Thus  |x(e)l  can  be  thought  of  as  the  magnitude  of  flow  in 
edge  e;  if  x(e)  >  0,  the  direction  of  flow  in  e  agrees  with 
the  orientation  of  e;  if  x(e)  <  0 ,  the  direction  of  flow 
is  against  the  orientation  of  e.  The  equations  (1-1) 
stipulate  that  a  units  of  flow  leave  s  and  enter  t,  flow 
being  conserved  at  all  other  vertices.  We  call  s  the  source, 
t  the  sink.  The  maximum  flow  problem  is  that  of  constructing 
an  x  that  satisfies  (1*1),  (1-2),  and  maximizes  a. 

We  can  get  rid  of  the  asymmetry  in  equations  (1.1)  by 
adding  a  special  edge  e'  to  G  joining  s  and  t,  say  e'  ~  (t,  s), 
which  returns  a  units  of  flow  to  s  from  t;  we  may  take  c(e') 
large.  In  other  words,  by  distinguishing  one  edge  e1  of 
a  graph,  the  maximum  flow  problem  may  be  viewed  as  that  of 
maximizing  x(e')  subject  to  (1.2)  and  the  conservation 
equations 

(1.1')  £a(v,  e)x(e)  -  0,  v  e  V. 

eeE 

For  the  moment,  we  shall  continue  to  work  with  (1*1)  and 

(1.2) ,  however. 

We  refer  to  the  graph  G  with  capacity  function  c  and 
distinguished  vertex  pair  s,  t  as  a  (two-terminal)  flow 
network,  or  briefly,  a  network.  In  general,  we  use  the 
word  network  in  this  paper  to  mean  a  graph  together  with 
one  or  more  real-valued  functions  defined  on  its  edges. 
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To  state  the  fundamental  theorem  about  maximum  network 
flow,  we  require  one  other  notion  about  graphs,  that  of  a 
cut.  A  cut  K  c  E  separating  s  and  t  in  a  graph  G  is  a 
subset  of  edges  that  has  some  edge  in  common  with  each  path 
joining  s  and  t  in  G.  We  say  that  K  blocks  all  such  paths. 
(Here  a  path  joining  s  and  t  is  a  sequence  of  distinct 
end-to-end  edges  that  starts  at  s  and  ends  at  t.  Edges  may 
be  traversed  with  or  against  their  orientations  in  going 
from  s  to  t  along  the  path.)  If  all  edges  of  K  are 
deleted  from  G,  the  vertices  s  and  t  fall  in  separate 
components  of  the  new  graph.  It  is  intuitively  clear  that 
a  in  (1.1)  is  bounded  above  by 

(1.3)  c (K)  -  £  c(e), 

eeK 

the  capacity  of  cut  K.  We  can  prove  this  from  (1*1)  and 
(1.2)  by  adding  those  equations  of  (1.1)  corresponding  to 
vertices  in  the  s-component  of  the  graph  G'  gotten  from  G 
by  deleting  edges  of  K.  The  result  is 

(1.4)  a  -  £,  x(e)  -  £_  x(e)  <;  c(K), 

eelT  eeK 

where  K+  (K~~ )  consists  of  those  edges  of  K  with  positive 
(negative)  end  in  the  s-component  of  G*  and  negative 
(positive)  end  outside  this  component.  In  words,  for  an 
arbitrary  flow  from  s  to  t  of  magnitude  a  and  an  arbitrary 
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cut  separating  s  and  t,  the  net  flow  across  the  cut  is  a, 
which  is  consequently  bounded  above  by  the  cut  capacity. 
Theorem  1.1  below  asserts  that  equality  holds  in  (1.4)  for 
some  flow  and  some  cut,  and  hence  the  flow  is  a  maximum 
flow,  the  cut  a  minimum  cut  [9]. 

Theorem  1.1.  For  any  network  the  maximum  amount  of 
flow  from  source  to  sink  is  equal  to  the  minimum  capacity  of 
all  cuts  separating  source  and  sink. 

Theorem  1.1,  the  max- flow  min -cut  theorem,  is  a 
combinatorial  version,  for  the  special  case  of  the  maximum 
flow  problem,  of  the  duality  theorem  for  linear  programs, 
and  can  be  deduced  from  it  [4].  Such  a  proof  makes  crucial 
use  of  the  fact  that  the  vertex-edge  incidence  matrix  of 
an  oriented  graph  G  is  totally  unimodular,  i.e.,  every  square 
submatrix  has  determinant  0  or  +  1.  A  simpler  proof  of 
Theorem  1.1  is  the  second  proof  given  by  Ford  and  Fulkerson 
[10].  This  proof  also  leads  to  an  efficient  algorithm  for 
constructing  a  maximum  flow. 

Proof  of  Theorem  1.1:  It  suffices  to  establish  the 
existence  of  a  flow  x  and  a  cut  K  for  which  equality  holds 
in  (1.4).  Let  x  be  a  maximum  flow,  of  amount  a,  from  s  to 
t.  Define  a  set  U  c  V  recursively  as  follows: 


(15a) 


s  e  U; 
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(1.5b)  if  u  e  U  and  e  ~  (u,  v)  is  an  edge  such 
that  x(e)  <  c(e),  then  v  €  U;  if  u  e  U 
and  e  ~  (v,  u)  is  an  edge  such  that 
x(e)  >  —  c(e)j  then  v  e  U. 

We  assert  that  t  €  U  ■  V  —  U.  For  suppost  not.  It  then 
follows  from  the  recursive  definition  of  U  that  there  is 
a  path  P  from  s  to  t  such  that  x(e)  <  c(e)  on  edges  e  e  P+ 

and  x(e)  >  -  c(e)  on  edges  e  e  P~ .  Here  P  ■  P+  U 

where  P+  consists  of  those  e  e  P  whose  orientations  agree 

with  the  orientation  of  P  from  s  to  t.  Let 

(1.6)  e  -  mintmin,  (c(e)  -  x(e)),  min_  (c(e)  +  x(e))]  >  0 

e€PT  eeP 


and  define 

/x(e),  e  4  P, 

(1*7)  x'(e)  “  |  x(e)  +  e,  e  €  P+, 

l  x(e)  -  e>  e  e  P~. 

Then  x'  is  a  feasible  flow  from  s  to  t  of  amount  a  +  e, 
contradicting  the  assumption  that  x  was  a  maximum  feasible 
flow.  Hence  t  e  U,  as  asserted.  Let  K  be  the  set  of 
edges  joining  U  and  U,  and  write  K  ■  K+  u  K*",  where  K+(K~) 
consists  of  those  edges  of  K  with  positive  (negative)  end 
in  U.  Then  K  is  a  cut  separating  s  and  t,  and  it  follows 
from  the  definition  of  U  that  x(e)  ■  c(e)  for  e  e  K+ 
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x(e)  “  -  c(e)  for  e  e  K  •  Hence  equality  holds  in  (1.4). 

Notice  that  the  proof  shows  that  a  flow  x  is  maximum 
if  and  only  if  there  is  no  x-augmenting  path  from  s  to  t 
(i.e.,  a  path  P  such  that  (1.7)  yields  a  feasible  flow  x'). 

If  we  assume  that  the  capacity  function  c  is  integral- 
(or  rational-)  valued,  the  proof  provides  a  good  algorithm 
for  constructing  a  maximum  flow.  We  can  begin  the 
computation  with  any  integral-valued  feasible  flow  from  s 
to  t,  e.g.,  x(e)  ■  0  all  e  t  E-  We  then  institute  a  fearch 
for  a  flow- augmenting  path  using  the  prescription  of  (1.5a) 
and  (1.5b).  A  good  way  to  apply  this  prescription  is  to 
fan  out  from  s  to  all  its  neighboring  vertices  that  can  be 
put  into  U  using  (1.5b);  then  repeat  the  process  by  selecting 
one  of  these  vertices,  scanning  it  for  all  its  neighbors 
not  yet  in  U  that  can  now  be  put  into  U,  and  so  on.  This 
way  of  searching  for  a  flow-augmenting  path  is  called  the 
"labeling  process"  in  [8],  where  it  is  described  in  terms 
of  assigning  labels  to  vertices  as  we  put  them  in  U;  in 
terms  of  (1.5b),  the  label  assigned  to  vertex  v  is  u. 

(This  simple  process  forms  the  basis  for  most  of  the 
network-programming  algorithms  described  in  [8].)  If  this 
search  is  successful  in  finding  t,  the  flow  increment  c 
of  (1.6)  is  a  positive  integer,  and  hence  x'  of  (1.7)  is 
again  an  integral-valued  flow.  If  unsuccessful,  the 
present  flow  is  a  maximum  flow,  and  a  minimum  cut  has  been 
located.  Thus  the  algorithm  terminates,  and  at  termination 
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we  have  constructed  an  integral  maximum  flow  and  a  minimum 
cut. 


Theorem  1.2.  If  the  capacity  function  c  is  integral- 
valued,  there  is  an  integral  maximum  flow. 

Theorem  1.2  is  important  in  combinatorial  applications 
of  network  flows. 

While  we  have  taken  the  capacity  constraints  (1.2)  to 
be  symmetric  about  the  origin,  there  is  no  real  need  for 
this  assumption.  The  constraints  (1*2)  can  be  changed  to 

(1-2')  b(e)  1  x(e)  £  c(e),  e  e  E, 

and  handled  in  an  analogous  fashion  provided  they  are 
feasible,  that  is,  the  constraint-set  (1*1),  (1 - 2 * )  is  nonempty. 
(Thus,  for  example,  "one-way  streets"  can  be  incorporated 
in  the  model*)  Even  the  feasibility  question  can  be  dealt 
with  by  an  appropriate  modification  of  the  argument  used  in 
the  proof  of  Theorem  1.1,  or  by  applying  a  version  of 
Theorem  1.1  to  an  enlarged  network.  For  a  detailed 
discussion  of  this  and  other  extensions,  e.g.,  capacities 
on  vertices  as  well  as  edges,  we  refer  to  [8].  Here  we 
shall  simply  state  a  typical  feasibility  theorem,  the 
circulation  theorem  due  to  Hoffman  [18]. 
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Theoretn  1.3.  Let  b(e)  c(e)  for  each  edge  e  of  a 
network  G  be  given  real  numbers.  The  constraints  (1.1') 
and  (1.2*)  are  feasible  in  G  if  and  only  if,  for  each 
subset  U  c  V,  we  have 

E  ,  c (e)  -  b(e)  ^  0, 

eeKT  eeK 

where  K+  (K  )  consists  of  those  edges  of  G  with  positive 
(negative)  end  in  U  and  negative  (positive)  end  in  V  -  U. 

Minty  [23]  has  distilled  from  the  above  proof  of  the 
max-flow  min-cut  theorem  and  from  other  network  algorithms 
of  Ford  and  Fulkerson  [10,  11]  a  theorem  about  graphs,  which 
Berge  and  Ghouila-Houri  [1]  have  called  "Lerrane  des  Arcs 
Colores."  We  call  it  the  painting  theorem.  To  state  it, 
we  require  some  definitions.  A  circuit  C  c  E  in  graph  G 
is  a  minimal  closed  path  in  G,  that  is,  a  set  of  edges 
which  forms  a  closed  path  and  is  minimal  with  respect  to 
this  property.  A  cocircuit  D  c  E  is  a  minimal  cut,  that 
is,  a  set  of  edges  whose  deletion  increases  the  number  of 
connected  components  of  G  and  is  minimal  with  respect  to 
this  property.  (In  terms  of  the  (0,  +  1) -vertex-edge 
incidence  matrix  of  an  orientation  of  G,  a  circuit 
corresponds  to  a  minimal  dependent  set  of  columns  of  the 
matrix,  where  "dependent"  means  "linearly  dependent  over 
the  rials."  If  G  is  unoriented,  and  the  vertex-edge  matrix 
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ls  taken  to  be  a  (0,  1)  -  matrix,  then  a  circuit  corresponds 
to  a  minimal  dependent  set  of  columns,  where  "dependent" 
means  "linearly  dependent  over  the  integers  mod  2.")  A 
painting  of  G  is  a  partition  of  the  edges  of  G  into  three 
sets  R,  W,  B,  and  the  distinguishing  of  one  edge  of  the 
set  R.  It  may  be  viewed  as  painting  the  edges  of  G  with 
three  colors — red,  white,  blue — with  one  red  edge  being 
distinguished  and  painted  dark  red. 

Theorem  1.4.  Given  a  painting  of  an  oriented  graph 
G,  precisely  one  of  the  following  alternatives  holds: 

(i)  There  is  £  In  G  CQnUining  ihs.  dark  x£d 

edge  but  no  white  edge,  in  which  all  red  edges  are 
similarly  oriented. 

(ii)  There  is  a  cocircuit  in  G  containing  the  dark 
red  edge  but  no  blue  edge .  in  which  all  red  edges  are 
similarly  oriented. 

Proof:  Let  e'  -  (t,  s)  be  the  dark  red  edge.  If 
e'  is  o  loop,  then  (i)  holds  and  (ii)  fails,  by  the 
minimality  of  a  cocircuit.  If  t  f  »,  define  a  subset 
U  c  V  recursively  by  the  rules 

(1.8a)  s  €  U; 

(1.8b)  if  u  c  U  and  e  ~  (u,  v)  is  red  or  blue,  then 
v  €  U;  if  u  €  U  and  e  —  (v,  u)  is  blue,  then 


V  €  U- 
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If  t  e  U,  there  Is  an  elementary  (minimal,  simple)  path 
from  s  to  t  of  red  and  blue  edges  in  which  all  red  edges 
are  oriented  in  the  path  direction.  This  path,  together 
with  edge  e',  provides  the  circuit  of  (i).  Conversely,  if 
(i)  holds,  then  t  e  U.  If  t  {  U,  consider  the  set  of  edges 
joining  U  to  U  •  V  -  U*  These  edges  are  either  white  or 
red,  and  any  red  edge  is  oriented  from  U  to  U,  as  e'  is. 
Delete  these  edges.  The  resulting  graph  has  components 
U,  U^,  .  •  • ,  with  t  f.  Ur  The  set  of  edges  joining  U 
and  is  the  cocircuit  of  (ii).  Conversely,  if  (ii)  holds, 
then  t  cannot  be  in  U  via  (1.8b). 

To  apply  the  painting  theorem  to  the  maximum  flow 
problem,  first  add  the  return- flow  edge  e'  ~  (t,  s)  to  the 
network  with  c(e')  large-  Let  x  satisfy  (1.1*),  (1.2). 

Paint  e'  dark  red.  For  other  edges  e:  If  c(e)  -  0,  paint 
e  white;  if  x(e)  •  c(e)  >  0,  paint  e  red  and  reorient  e; 
if  x(e)  *  -  c(e)  <  0,  paint  e  red;  if  -  c(e)  <  x(e)  <•  c(e), 
paint  e  blue.  Alternative  (i)  of  the  painting  theorem 
then  leads  to  a  flow-augmenting  path,  whereas  (ii)  leads 
to  a  minimum  cut.  In  this  application  the  white  edges 
play  a  pale  role — they  could  have  been  deleted  once  and 
for  all.  But  there  are  other  network-programming  problems 
for  which  labeling  algorithms  that  have  been  described 
(10,  11,  12,  23]  can  be  viewed  in  terms  of  edge  paintings; 
the  role  played  by  white  edges  is  less  passive  in  some  of 
these. 
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Before  leaving  the  discussion  of  maximum  network 
flow,  we  mention  an  alternative  formulation  of  the  problem. 
This  formulation  is  in  terns  of  the  path-edge  incidence 
matrix  of  an  unoriented  graph;  it  was  used  in  the  first 
proof  of  the  max-flow  min-cut  theorem  [91-  Let  P  be  the 
collection  of  all  paths  from  s  to  t  in  G.  For  each  P  c  'P 
and  e  e  E  define  an  integer  p(P,  e)  “  1  or  0  according  as 
e  c  P  or  e  ^  P.  We  call  the  resulting  matrix  the  path- 
edge  incidence  matrix  of  G-  Let  y  be  a  real-valued 
iunction  with  domain  P  that  satisfies 


d-9) 

Lj  y (P)p (Pj  e)  <  c(e) 

Pe** 

(1.10) 

y(P)  ^0,  P  €*>. 

Thus  y(P)  can  be  thought  of  as  the  magnitude  of  flow  in  P, 
and  (1.9)  says  that  the  total  amount  of  flow  in  e  cannot 
exceed  its  capacity.  Subject  to  (1.9),  (1.10),  we  wish  to 
maximize 

a. 11)  £  y(P). 

fcP 

This  version  of  the  problem  might  seem  to  be  more  restrictive, 
since  if  two  paths  P^  and  P2  contain  the  same  edge  e  in 
opposite  directions,  (1.9)  insists  that  we  add  y(P^)  and 
y(?2)  instead  of  "cancelling  flows  in  opposite  directions." 


The  two  formulations  are  equivalent,  however. 

If  the  capacity  function  c  is  integral  valued,  there 
is  an  integral-valued  y  satisfying  (1.9),  (1.10),  and 
maximizing  (1.11).  An  edge-form  of  Menger's  theorem  [20]  can 
be  deduced  from  this: 

Theorem  1.5.  Let  G  be  an  unoriented  graph  with  two 
distinguished  vertices  s  and  t.  The  maximum  number  of 
edge— disjoint  paths  joining  s  and  t  is  equal  to  the  minimum 
number  of  edges  in  a  cut  separating  s  and  t. 

2.  MINIMUM  PATH 

Let  1(e)  be  a  real  nonnegative  number  associated  with 
edge  e  of  an  unoriented,  connected  graph  G.  We  shall  think 
of  i(e)  as  the  length  of  edge  e.  The  length  of  path  P  is 

(2.1)  t(P)  -  £  1(e). 

eeP 

The  second  problem  concerning  two— terminal  networks  that 
we  consider  is  the  minimum  path  problem:  to  find  a  path 
joining  s  and  t  that  has  minimum  length.  There  are  several 
good  methods  known  for  doing  this.  We  describe  one  below, 
but  first  we  state  and  prove  a  theorem  that  is  a  path-cut 
dual  of  the  max-flow  min— cut  theorem.  Consider  the  maximum 
flow  problem  in  terms  of  the  path-edge  incidence  matrix. 
Suppose  now  that  we  form  the  cut-edge  incidence  matrix  by 
defining  d(K,  e)  -  1  or  0  according  as  e  e  K  or  e  {  K. 

Here  K  is  a  cut  separating  s  and  t.  Let  it  denote  the 
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class  of  such  cuts.  Analogously  to  (1.9),  (1.10),  let 
y  be  a  real-valued  function  with  domain  satisfying 

(2.1)  £  y(K)d(K,  e)  <  t(e),  e  e  E, 

ketf 

(2.2)  y(K)  >0,  K  e  X  ■ 

Again  we  wish  to  maximize 

(2-3)  S  y(K) 

Kelt 

subject  to  these  constraints. 

The  maximum  value  of  (2.3)  cannot  exceed  the  length 
of  a  minimum  path  from  s  to  t,  because  a  path  from  s  to 
t  has  some  edge  in  common  with  each  K  e  fC  . 

Theorem  2 ■ 1 ■  The  maximum  value  of  (2.3)  subject  to 
(2.1)  and  (2-2)  is  equal  to  the  minimum  path  length  from 
s  to  t. 

The  purely  combinatorial  version  of  (2.1)  -  (2.3)  in 
which  t(e)  *  1  all  e  e  E  and  y(K)  -  0  or  1  all  K  e  , 
asks  for  the  maximum  number  of  mutually  disjoint  cuts 
separating  s  and  t.  As  was  the  case  for  the  maximum  flow 
problem,  rf  l  is  integral  valued,  there  is  an  integral¬ 
valued  y  that  solves  the  linear  program  (2.1)  -  (2.3). 

This  will  fellow  from  the  proof  given  below.  Hence  the 
maximum  number  of  disjoint  cuts  separating  s  and  t  is  equal 
to  the  minimum  number  of  edges  in  a  path  joining  s  and  t. 
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Proof  of  Theorem  2.1.  Let  n (v)  be  the  minimum  path 
length  from  s  to  v,  for  all  v  e  V.  Thus  tt(v)  >  0  ar.d 
rr(s)  *  0.  Let  0  «  Hq  <  rr^  <  ...  <  tt^  be  the  distinct  values 
assumed  by  n.  Partition  V  into  n  +  1  parts  Vq,  V^,  ...,  Vn, 
where 


-  {v  e  V |tt  (v)  -  tt1}  . 

Thus  s  e  Vq.  Suppose  t  €  V^.  We  then  single  out  k  cuts 

K, ,  K0,  . . . ,  K.  In  Hi  by  letting  K.  be  the  set  of  edges 
12  Tl  j_i  J  j-i 

joining  vertices  of  U  V.  and  vertices  of  V  -  u  V., 

i-0  1  i-()  1 

j  -  1,  2,  ...,  k.  Define  y(Kj)  *  rr^  -  tt^,  j  *  1.  2,  .  ..,  k, 

and  y(K)  -  0  for  other  cuts  K  €  .  Then  y  solves  (2.x)  - 

(2.3).  To  prove  this,  it  suffices  to  show  that  y  satisfies 

(2.1),  since  clearly  y(K)  >  C  all  K  t  ,  and 

k 

£  y(K)  -  £  (TTj  -  TT »  TTk  -  TTq  -  TTk  -  TT(t)  . 

Ke  1C  j—l 

Thus  consider  an  edge  e  joining  a  vertex  u  cf  and  a 
vertex  v  of  Vj,  where  i  <  j  <  k,  so  that  e  belongs  to  each  of 
the  cuts  Kj^,  •..,  K j ,  but  to  no  other  cut  having  positive 
weight  in  y.  Suppose  that 

y(Ki+i)  +  •  •  •  +  y(Kj)  -  nj  -  tt±  >  1(e). 

There  is  a  path  from  s  to  u  of  length  tt^;  adjoining  e  to 
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this  path  yields  a  path  from  s  to  v  of  length  rr^  +  4(e)  < 
ni  +  (nj  ~  ni^  “  17 j  ’  a  contrac*iction.  If  j  >  k,  a  similar 
contradiction  results.  Hence  y  satisfies  (2.1)  and  solves 
(2.1)  -  (2.3). 

For  the  case  of  a  planar  two-terminal  network  (that  is, 
the  graph  G  together  with  the  additional  edge  e'  joining 
the  terminals  s  and  t  is  a  planar  graph),  where  one  can 
construct  a  dual  two-terminal  network  in  which  source-sink 
paths  correspond  to  cuts  separating  s  and  t  in  the  primal 
network,  the  duality  between  the  maximum  flow  problem  and 
the  minimum  path  problem  was  noted  in  [9],  and  was  exploited 
in  developing  a  max-flow  algorithm  for  such  networks,  the 
"top-most  path"  method  of  [9].  Theorem  2.1  for  arbitrary 
two-terminal  networks  is  due  to  Robacker  [27].  From  the 
point  of  view  of  Part  II  of  this  paper.  Theorem  2.1  and 
the  max-flow  mirr-cut  theorem  are  abstractly  the  same. 

We  return  now  to  the  problem  of  constructing  a  minimum 
path  joining  s  and  t.  The  procedure  we  sketch  here  is  a 
special  case  of  a  more  general  algorithm  for  constructing 
minimum  cost  flows  in  networks  [11].  It  evaluates  the 
minimum  path  length  tt(v)  from  s  to  v  for  all  v  €  V,  and 
hence  provides  a  solution  y  to  (2.1)  -  (2.3).  We  may 
suppose  in  the  description  that  there  are  no  loops  or  multiple 
edges  in  G.  If  edge  e  has  ends  u,  v,  we  write  the  unordered 
pair  (u,  v)  for  e  and  4(u,  v)  for  4(e). 

To  start  out,  take  n(s)  -  0.  Next  look  at  all  edges 
(s,  v)  and  find  the  minimum  value  of  4(s,  v)  for  such  edges. 

If  v  is  a  vertex  yielding  this  minimum,  set  n(v)  •  4(s,  v) . 
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The  general  step  of  the  computation  is  as  follows.  Suppose 
that  tt(u)  has  been  defined  for  u  e  U  c  V.  Let  U  -  V  -  U  and 
compute 

(2.4)  min  Itt(u)  +  t(u,  v)J  -  6. 
ueUjVeU 

If  the  minimum  in  (2.4)  is  achieved  for  an  edge  (u,  v)> 
set  tt(v)  -  6.  Repeat  the  general  step  until  n(v)  has  been 
defined  for  all  v  e  V.  The  number  tt(v)  defined  in  this  way 
is  the  minimum  path  length  from  s  to  v.  A  convenient  way 
to  do  the  calculation  is  to  assign  to  vertex  v  the  label 
(u,  tt(v)),  where  u  is  some  vertex  for  which  the  minimum 
in  (2.4)  is  achieved.  A  minimum  path  from  s  to  v  can  then 
be  found  by  backtracking  from  v  to  s  as  directed  by  first 
members  of  the  labels. 

At  the  conclusion  of  the  computation,  the  numbers  tt(v) 
satisfy  the  inequalities 

(2.5)  -t(u,  v)  <  tt(v)  -  tt(u)  <  t(u,  v) 

for  all  edges  (u,  v)  of  G,  and  maximize  n(t)  -  tt(s)  subject 
to  (2.5).  If  we  interpret  t(u,  v)  as  the  cost  of  transporting 
a  unit  of  some  commodity  over  edge  (u,  v),  the  number  tt(v) 
can  be  given  the  economic  interpretation  of  a  price  placed 
on  a  unit  of  the  conmodlty  at  location  v.  Inequalities 
(2.5)  then  say  that  no  profit  can  be  made  by  purchasing  a 


-19- 


unit  of  the  commodity  at  u  and  transporting  it  to  v  or  vice 
versa.  Subject  to  these  restrictions,  the  price  difference 
n(t)  -  tt(s)  is  to  be  maximized.  Thus  the  maximum  value  of 
rr(t)  -  rr(s)  subject  to  (2.5)  is  equal  to  the  minimum  path 
cost  from  s  to  t.  In  another  interpretation,  Duffin  has 
called  this  result  the  "max-potential  equals  min— work" 
theorem  [5]. 

The  assumption  that  edge  lengths  are  nonnegative  has 
been  used  in  an  essential  way  in  this  section  .  If  edge 
lengths  are  allowed  to  be  negative,  and  if  we  ask  for  a 
minimum  length  simple  path  joining  two  vertices,  the  problem 
is  much  harder.  There  are  no  known  good  algorithms  for 
constructing  such  a  path. 

3.  MAXIMUM  CAPACITY  PATH 

Again  we  consider  a  two-terminal  unoriented  network 
G  with  source  s,  sink  t,  and  capacity  function  c.  This 
time  we  wish  to  find  a  path  P  from  s  to  t  that  has  the  largest 
flow  capacity  of  all  such  paths,  i.e.,  we  want  to  find  a  P 
that  achieves 

(3- 1)  max  min  c(e), 

Pef^etP 

where  P  is  the  class  of  all  paths  joining  s  and  t.  We 
call  this  the  maximum  capacity  path  problem. 

This  bottleneck  problem  has  been  considered  in  [13,  19, 
26].  It  is  related  to  the  minimum  path  problem  in  the  sense 
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that  methods  for  solving  the  latter  can  be  modified  to 
solve  it.  But  here  we  shall  describe  another  easy  way  of 
solving  the  problem,  one  that  extends  to  blocking  systems 
(Part  III).  This  method  of  solution  might  be  termed  the 
"threshold  method."  It  leads  to  the  following  min-max 
theorem  concerning  paths  and  cuts  [13]. 

Theorem  3.1.  Let  G  be  a  network  with  capacity  function 
c  and  terminals  s  and  t.  Then 

(3.2)  max  min  c(e)  *  min  max  c(e), 

P ei>  eeP  Ke*  eeK 

where  -f*  is  the  class  of  all  paths  joining  s  and  t  and 
X  is  the  class  of  all  cuts  separating  s  and  t. 

Proof.  If  P  €  "P  and  K  e  V  ,  then  P  fl  K  is  nonempty. 
Let  e'  e  P  D  K.  Then 

min  c(e)  <  c(e’)  <  max  c(e). 
eeP  ~  eeK 


It  follows  that 

(3.3)  max  min  c(e)  <  min  max  c(e). 

Pe?  eeP  Ketf  eeK 

To  establish  equality  in  (3.3),  we  can  proceed  as 
follows-  Let  ci  >  c2  >  •••  >  cn  be  the  distinct  values 
assumed  by  the  capacity  function,  and  let  Cq  be  large. 
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Let  be  the  network  obtained  from  G  by  deleting  all  edges 

e  satisfying  c(e)  <  c^,  i  -  0,  1,  . ..,  n.  Thus  Gq  has  no 

edges,  and  Gn  *  G.  Suppose  Gj  is  the  first  G^  that  contains 

a  path  joining  s  and  t.  (We  are  tacitly  assuming  that  P 

is  nonempty,  although  an  appropriate  interpretation  of 

(3.2)  holds  if  this  isn't  so.)  Since  Gj  has  a  path  P  e  'P 

and  G,i  contains  no  path  in  P  ,  we  have  min  c(e)  »  c,. 

J  1  eeP  J 

On  the  other  hand,  the  edges  deleted  from  G  in  forming 

Gj_i  contain  a  cut  K  c  ,  whereas  the  edges  deleted  from 

G  in  forming  G,  contain  no  cut  in  1C,  and  thus  max  c(e)  ■  c,. 

J  eeK  J 

Consequently  equality  holds  in  (3.3). 

Thus  to  solve  the  maximum  capacity  path  problem,  we 

lower  the  threshold  for  edge  capacities  until  a  path 

joining  s  and  t  is  produced.  There  are  good  algorithms 

for  recognizing  when  this  happens. 

Notice  that  no  use  is  made  of  the  fact  that  c(e)  >  0. 

Indeed  the  solution  depends  only  on  the  ordering  of  the 

edge  numbers  c(e),  not  on  their  magnitudes. 

An  appropriate  version  of  the  threshold  method  can 

be  used  to  locate  a  flow-augmenting  path  that  yields  the 

largest  flow  increment  (1.6).  Thus  one  way  to  solve  the 

maximum  flow  problem  is  to  successively  find  maximum  capacity 

flow-augmenting  paths  by  a  threshold  method. 


One  can  also  show 
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(3.4)  min  max  c(e)  -  max  min  c(e). 

PcT °  eeP  KeK  eeK 

For  an  interpretation,  think  of  G  as  a  highway  map  with 
c(e)  being  the  maximum  elevation  encountered  in  driving 
over  edge  e. 

4.  LENGTH-WIDTH  INEQUALITY. 

Duffin  (5]  has  defined  the  notions  of  "extremal  length" 
and  "extremal  width"  for  two-terminal  networks  having  edge 
resistances  and  has  shown  that  these  are  reciprocal  quantities. 
From  this  relationship  he  deduced  a  certain  inequality  con¬ 
cerning  paths  and  cuts  for  a  two-terminal  network  in  which 
each  edge  has  associated  with  it  two  nonnegative  numbers 
t(e)  and  w(e),  the  length  and  width  of  e.  An  earlier,  purely 
combinatorial  version  of  this  inequality  in  which  1(e)  - 
w(e)  -  1  is  due  to  Moore  and  Shannon  (25].  This  version 
says  that  if  X  is  the  least  number  of  edges  in  a  path 
joining  8  and  t  and  w  is  the  least  number  of  edges  in  a 
cut  separating  s  and  t,  then  Xju  is  less  than  or  equal  to 
the  number  of  edges  in  the  graph.  More  generally,  let 

(4.1)  X  »  min  l(P)  •  min  Z/  t(e), 

PtP  PtP  ecP 

(4.2)  •  min  w(K)  •  min  Z j  w(e), 

Kc  H  KcX  eeK 

where  'P  is  the  class  of  all  paths  joining  s  and  t,  Is 

the  class  of  all  cuts  separating  s  and  t.  The  number  x 
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is  called  the  length  of  G,  uu  the  width  of  G,  relative  to 
s  and  t.  The  length-width  inequality  asserts  that 

(4.3)  X-ju  <  Z /  l  (e)w(e) . 

eeE 

A  proof  of  (4.3)  can  be  given  usin6  either  the  max- 
flow  min-cut  theorem  or  its  path-cut  dual.  We  use  the 
former  approach.  Interpret  w(e)  as  the  flow-capacity  of 
e.  Then  by  the  max-flow  min-cut  equality,  there  is  a  flow 
from  s  to  t  of  magnitude  uu.  It  follows  that  there  is  a 
function  y  defined  on  satisfying  (1.9),  (1.10),  and 


£  y(P) 
P€^ 


UU. 


Thus 


Xw  -  \  £  y(P)  <  £  t(P)y(P)  -  £  £  t (e)y(P) 

Pc'?  P tP  ecP 

<  £  t(e)  £  y(P)p(P,  e)  <  £  t(e)w(e). 

ecE  Pc  ecE 

Although  the  length-width  inequality  appears  weak, 
we  shall  point  out  in  Part  III  that  the  existence  of  a 
length-width  inequality  for  a  blocking  system  implies  the 
max-flow  min-cut  equality  for  the  system. 
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PART  II  -  FRAMES 

Our  aim  in  this  part  of  the  paper  is  to  indicate  how 
the  theorems  of  Part  I  can  be  generalized  to  frames  of 
subspaces  of  Euclidean  n-space.  (We  shall  define  a  frame 
later  on.  But  it  should  be  mentioned  here  that  the  word 
"frame'1  was  used  by  Tutte  in  some  of  his  early  work  on 
chain— groups  and  matroids  in  place  of  the  word  "matroid". 

We  appropriate  it,  with  his  permission,  for  a  more 
restrictive  use.)  The  notion  of  a  frame  is  closely  related 
to  that  of  a  metric  matroid.  Inceed  a  frame  can  be  viewed 
as  the  structure  obtained  just  prior  to  the  matroid  in 
making  the  transition  from  matrix  to  its  matroid. 

Matroids  were  introduced  by  Whitney  [ 35 J  as  a  gener¬ 
alization  of  dependence  properties  in  graphs  or  in  matrices. 
There  is  now  an  extensive  and  deep  theory  of  matroids, 
mostly  due  to  Tutte  [30,  31,  32.  33,  34] .  We  require  only 
the  more  elementary  parts  of  this  theory.  (Certainly  Tutte's 
Introduction  to  the  Theory  of  Matroids  [34]  would  suffice) 

The  generalization  from  Part  I  to  Part  II  can  be  des¬ 
cribed  roughly  as  that  obtained  by  replacing  the  vertex- 
edge  incidence  matrix  of  an  oriented  graph  by  an  arbitrary 
real  matrix.  (More  generally,  we  cculd  consider  matrices 
over  any  ordered  field.)  Thus  we  pass  from  the  special 
network  programs  of  Part  I  to  general  linear  programs. 

Associated  with  every  linear  program  there  is  a  dual 
program.  Associated  with  every  matroid  there  is  a  dual 
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matroid  Associated  with  every  frame  there  is  a  dual  frame. 
Frame  duality  provides  a  bridge  between  matroid  duality  and 
linear  programming  duality.  The  basic  concept  underlying 
duality  in  all  tnree  instances  is  orthogonality. 

Although  the  material  of  this  part  of  the  paper  was 
developed  independently  by  "he  writer,  we  doubt  that  much 
of  it  is  new.  A  recent  paper  by  Rockafellar  [28]  contains 
a  similar  development,  for  example.  Our  attention  has  also 
been  called  to  work  of  Camion  [2],  and  to  a  forthcoming 
book  on  networks  by  Iri.  Most  of  the  notions  and  some 
of  the  results  are  either  explicit  or  implicit  in  Tutte's 
work  on  matrcids.  We  believe  that  our  treatment  of  the 
generalized  maximum  flow  problem  and  the  resulting  length- 
width  inequality  for  real  matrices  may  be  new,  however. 

1.  FRAMES  OF  REAL  SUBSj ACES 

Let  ft  be  an  arbitrary  subspace  of  n— dimensional 
Euclidean  space  *0n.  For  the  correspondence  with  Part  I, 
a  vector  X  *  (x^,  X2,  ...,  xn)  in  7?n  should  be  thought  of 
as  a  real-valued  function  on  a  finite  set  of  "edges" 

F  ■  tep  e2,  •••,  en]  that  maps  e^  into  Xp  and  ft  should 
be  viewed  as  the  row  space  of  an  m  by  n  real  matrix 
A  *  (a^j),  t*ie  "generalized  vertex— edge  incidence  matrix". 

Let  Y  ■  (y^,  y 2>  •••,  yn)  be  a  vector  of  ft.  The 
support  S(Y)  of  Y  consists  of  those  ei  e  E  such  that  y^  +  0. 
A  vector  Y  €  ft  is  called  an  elementary  vector  of  ft  if  it 
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is  nonzero  and  if  there  is  no  nonzero  vector  X  e  18  such 
that  S(X)  is  a  proper  subset  of  S(Y).  Thus  if  X  and  Y 
are  two  elementary  vectors  of  7 G  having  the  same  support, 
then  X  is  a  nonzero  multiple  of  Y.  Consequently  we  may 
associate  with  %  a  unique,  finite  set  of  lines,  each  line 
being  determined  by  an  elementary  vector  of  18.  We  call 
this  collection  of  lines  the  frame  ■  >(*C)  of  it, 
and  sometimes  refer  to  an  elementary  vector  F  of  f  as  a 
frame-vector  of  H. . 

Let  X  and  Y  be  vectors  of  it .  The  vector  X  conforms 
to  Y  if  >  0  whenever  x^  ^  0.  In  particular,  S(X)  c  S(Y). 

Lemma  1.1.  Let  Y  be  a  nonzero  vector  of  jp.  There 
exists  an  elementary  vector  F  of  t  that  conforms  to  Y. 

Proof;  If  not,  select  Y  •  (y^,  ...,  y^)  e  it  so 

that  no  elementary  vector  of  7t  conforms  to  Y,  and  so  that 
the  number  of  elemencs  in  S(Y)  is  as  small  as  possible 
consistent  with  this  condition.  Let  X  -  (x^,  *2>  • • • >  xn) 
be  an  elementary  vector  of  H  such  that  S(X)  c  S(Y).  Let 
I  c  E  denote  the  set  of  e^  e  E  such  that  y^  and  x^  have 
opposite  signs.  Thus  I  is  nonempty.  Consider  the  vector 
Z  ■  Y  +  aX,  where 


The  vector  Z  conforms  to  Y  and  S(Z)  is  properly  included 
in  S(Y) .  By  the  selection  of  Y,  there  is  an  elementary 
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vector  F  conforming  to  Z.  But  then  F  conforms  to  Y.  This 
contradiction  establishes  the  lemma. 

An  important  consequence  of  Lenina  1.1  is  that  any  non¬ 
zero  vector  Y  e  It  can  be  written  as  a  sum 

(1.1)  Y  -  Fj_  +  F2  +  ...  +  Fk 

of  elementary  vectors  of  79,  where  each  elementary  vector 
F^  in  (1.1)  conforms  to  Y,  and  two  elementary  vectors 
Ft ,  Fj  with  i  +  j  lie  on  distinct  frame— lines  of  7t .  We 
call  (1.1)  a  conformal  frame  decomposition  of  Y.  In  general , 
such  a  decomposition  is  far  from  unique ,  of  course. 

We  return  now  to  the  matrix  A  -  (a^)  whose  rows 
generate  7Z  .  A  (column)  pivot  on  an  element  a^  ^  0  of 
A  is  a  sequence  of  elementary  row  operations  on  A  that 
transforms  A  into  a  matrix  A'  *  (a|j)  *n  w^ich  a^  ■  1, 
a^  ■  0  for  i  +  k.  Starting  with  A,  we  can  produce  from 
it  by  a  sequence  of  column  pivots  and  deletions  of  zero 
rows  a  matrix  R  whose  columns  can  be  permuted  to  have  the 
form 

(1.2)  (I,  B). 

If  A  has  rank  r,  then  R  is  r  by  n,  the  rows  of  R  are  a 
basis  for  79,  and  R  contains  an  r  by  r  permutation  sub¬ 
matrix  whose  columns  correspond  to  some  S  c  E.  Following 
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Tutte,  we  refer  to  such  a  matrix  R  as  a  standard  representa¬ 
tive  matrix  of  iC-  Note  that  each  row  of  R  is  an  ele¬ 
mentary  vector  of  ft.  The  following  theorem  asserts  that, 
conversely,  any  elementary  vector  of  it  can  be  obtained 
from  A  by  a  finite  sequence  of  pivots. 

Theorem  1.2.  Let  F  be  an  elementary  vector  of  ft. 

Then  there  exists  a  standard  representative  matrix  R  of  X 
having  a  multiple  of  F  as  one  of  its  rows. 

Proof.  Extend  F  to  a  basis  X  of  it,  and  write  the 
resulting  collection  of  vectors  as  a  matrix  having  F  as 
its  first  row,  say.  Pivot  on  a  nonzero  coordinate  of  F. 
Consider  the  second  row  of  the  transformed  matrix.  This 
row  has  a  nonzero  coordinate  in  one  of  the  columns 
corresponding  to  zero  coordinates  of  the  first  row,  for 
otherwise  either  F  would  not  be  elementary  or  #  would 
not  be  a  basis.  Pivot  on  such  an  element.  Repetition 
of  this  process  produces  a  standard  representative  matrix 
R  of  X  having  a  multiple  of  F  as  its  first  row. 

In  particular,  an  elementary  vector  of  It  can  have 
at  most  n  —  r  +  1  nonzero  coordinates. 

Notice  also  that  if  K  and  J  are  subspaces  having 
the  same  frame  P(X)  ■  then  X  »  A  . 

2 .  MATROIDS 

A  matroid  is  a  purely  combinatorial  structure  defined 
on  a  finite  set  E.  There  are  a  number  of  equivalent  axiom 
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systems  for  matroids.  One  in  terms  of  "circuits”  is  as 
follows.  Let  £  be  a  finite  family  of  nonempty  subsets  of 
E.  Members  of  €  are  the  circuits  of  a  matroid  (E,  £  ) 
if  the  following  axioms  hold: 

(2.1)  No  member  of  C  is  a  proper  subset  of  another. 

(2.2)  Let  e^  and  e2  be  distinct  members  of  E,  and  suppose 
and  C2  are  members  of  £  such  that  e^  e  n  C2  and 

e2  €  ^1  —  ^2*  T*,en  there  exists  e  £  such  that 
®2  ®  ^3  ^  ^  ^9)  1®^}* 

The  motivation  comes  from  graphs.  Let  E  be  the  set 
of  edges  of  an  unoriented  graph  G.  Then  the  collection  C 
of  (graph)  circuits  of  G  satisfies  (2.1),  (2.2),  and  thus 
(E,  £ )  is  a  matroid.  Such  a  matroid  is  graphic.  The 
collection  Jt  of  cocircuits  of  G  also  satisfies  (2.1), 

(2.2) ,  and  thus  forms  a  matroid  (E,  Such  a  matroid  is 

cographic.  For  another  important  example,  consider  the 
row  space  It  of  the  m  by  n  matrix  A.  Take  E  *  {e^,  e2> 

...,  enl-  Then  the  collection  £  of  supports  of  frame- 
vectors  of  £  satisfies  (2.1),  (2.2)  and  is  consequently  a 
matroid  (E,£).  Such  a  matroid  is  called  a  real  metric 
matroid. 

Associated  with  every  matroid  (E, £ )  there  is  a  unique 
dual  matroid  (E,  £*).  A  subset  of  E  is  a  member  of  €* 
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if  and  only  if  the  cardinality  of  its  intersection  with 

every  element  of  <S  is  not  equal  to  1,  and  it  is  minimal  with 

respect  to  this  property.  The  dual  of  the  dual  is  the 

primal:  (E,<J**)  *  (E,  <?).  In  case  (E,  6)  is  a  graphic 

matroid,  the  cographic  matroid  (E ,J&)  is  the  dual: 

(E  ,0)  -  (E,  (&*) ,  (E  ,J3*)  -  (E,  d )  ■  If  (E  ,&)  is  a 

real  matric  matroid  arising  from  a  subspace  It,  the  dual 

matroid  is  the  real  matric  matroid  obtained  from  the 

orthogonal  complement  it*  of  it.  Thus  if  >is  the  frame 

of  it,  we  call  the  frame  3*  of  it*  the  dual  of  3- 

If  #  has  standard  representative  matrix  R  ■  (Ir,  B), 

then  a  standard  representative  matrix  for  It*  is  R*  ■ 

T 

(B  ,  -  In_r)*  A  frame— vector  of  <  can  be  viewed  as 
representing  the  coefficients  of  a  minimal  linear  dependency 
among  columns  of  R*. 

Let  A  -  (a^ j )  be  the  vertex-edge  incidence  matrix  of 
an  oriented  graph  G.  It  is  well-known  that  the  matrix  A 
has  the  total  unimodularity  property:  every  square  sub¬ 
matrix  of  A  has  determinant  0,  1,  or  —1.  One  can  deduce 
from  this  that  each  elementary  vector  of  the  row  space  It 
of  A  is  a  multiple  of  a  vector  having  coordinates  0,  1, 
or  —I.  Such  a  vector  is  called  primitive.  Conversely,  if 
a  subspace  #  has  the  property  that  each  elementary  vector 
of  if  is  a  multiple  of  a  primitive  vector,  then  It  is  the 
row  space  of  some  totally  unimodular  matrix  A  *  (a^j). 
particular,  a^  •  0,  1,  or  -1.  Such  a  space  H  is  called 


regular  and  Che  corresponding  matroid  is  a  regular  matrold . 
Thus  regular  matroids  are  precisely  chose  real  maCric 
matroids  generaCed  by  CoCally  unimodular  matrices.  The 
dual  of  a  regular  matroid  is  regular.  A  dual  pair  of 
regular  maCroids  is  called  a  "digraphoid"  in  [24]. 

(1C  should  be  remarked,  Chough  we  make  no  use  of  it 
here,  Chat  Tutte  has  shown  that  a  regular  matroid  is  a 
binary  matric  matroid,  that  is,  a  matroid  generated  by  a 
matrix  over  Che  field  of  two  elements,  and  has  characterized 
regular  matroids  as  a  subset  of  the  binary  matric  matroids. 
This  characterization,  which  is  in  terms  of  certain  excluded 
matroid  minors — a  matroid  minor  is  not  the  same  thing  as 
a  matrix  minor — is  deeper  than  the  one  above,  also  due  to 
Tutte,  of  regular  matroids  as  a  subset  of  real  matric 
matroids.  It  can  also  be  shown,  as  was  pointed  out  to  the 
writer  by  Edmonds,  that  a  matroid  is  regular  if  and  only  if 
it  is  both  a  real  matric  matroid  and  a  binary  matric  matroid. 
From  this  one  can  deduce  that  a  (0,  +  1)— matrix  (I,  B)  is 
totally  unimodular  if  and  only  if  the  binary  rank  of  any 
subset  S  of  its  columns  is  equal  to  the  real  rank  of  S. 

This  can  also  be  proved  directly.  It  is  also  possible  to 
give  a  characterization  of  regular  matroids  among  those 
real  matric  matroids  generated  by  (0,  +  l)-matrices  in 
terms  of  a  single  excluded  matroid  minor:  namely,  exclude 
the  self-dual  matroid  on  a  set  of  four  elements,  every 
triple  of  which  is  a  circuit.  The  problem  of  characterizing 
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regular  matroids  among  all  real  matric  matroids  In  terms 
of  excluded  matroid  minors  appears  to  be  open,  as  does  the 
more  fundamental  problem  of  giving  necessary  and  sufficient 
conditions  in  order  that  two  real  matrices  generate  the 
same  matroid.) 

The  real  matric  matroid  generated  by  the  vertex-edge 
incidence  matrix  A  of  an  oriented  graph  is  a  regular  matroid. 
The  nonzero  coordinates  of  an  elementary  vector  F  of  the 
row  space  of  A  pick  out  a  cocircuit  in  the  graph,  two 
edges  being  similarly  oriented  in  this  cocircuit  if  the 
corresponding  coordinates  of  F  have  the  same  sign.  Con¬ 
versely,  each  cocircuit  of  the  graph  can  be  exhibited  in  this 
way  as  an  elementary  vector  of  X.  On  the  other  hand,  non¬ 
zero  coordinates  of  an  elementary  vector  of  it*  pick  out  a 
circuit  in  the  graph,  two  edges  being  similarly  oriented 
in  this  circuit  if  the  corresponding  coordinates  have  the 
same  sign,  and  each  circuit  of  the  graph  can  be  exhibited 
in  this  way. 

3.  GENERALIZED  FLOWS  AND  CUTS 

Let  A  *  (.a^j )  be  an  m  by  n  real  matrix  having  row 
space  it.  For  each  e^  e  E  -  {e^,  *2>  •••*  en),  let  c^ 
be  a  nonnegative  real  number,  the  capacity  of  e^ .  In 
analogy  with  ( 1 . 1  * )  and  (1.2)  of  Part  I,  we  define  a 
(feasible)  flow  X  on  A  to  be  a  vector  X  •  (x^,  X2»  • • • » 
xn)  that  satisfies  the  linear  homogeneous  equations 
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n 

(3*1)  a^x^  **  ^  ^  *  2,  *  •  •  *  ®» 

j-1 

and  inequalities 

(3.2)  -Cj  <  Xj  <  c^,  j  -  1,  2,  .  ..,  n. 

Thus  X  €  If*.  Clearly  feasible  flows  exist,  e.g.  X  -  0. 

The  analogue  of  the  maximum  flow  problem  is  to  find  a 
feasible  flow  X  on  A  that  maximizes  some  specified  component 
of  X,  say  Xp  where  c^  ■  •  .  We  call  such  a  flow  a  maximum 
e^-flow. 

Let  K  -  (-1,  V-2>  •••»  hn)  be  an  elementary  vector  of 
£.  (Such  elementary  vectors  exist  unless  the  first  column 
of  A  consists  entirely  of  0's — this  corresponds  to  the 
graphic  case  in  which  is  a  loop.)  We  say  that  K  is  an 
e^-cut.  There  are  finitely  many  such.  The  capacity  of  an 
ej-cut  K  is  defined  to  be 

non 

(3-3)  kjcj  “  S  kjcj 

-2  J-2 

^>0  kj<0 

If  X  is  a  feasible  flow  and  K  an  e^-cut,  then,  since 
X  €  H*  and  K  €  It,  we  have 

n 

r  *jkj  -  °< 

J-* 


£  lkjcji- 
J-2 
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and  hence,  by  (3.2), 

n  n 

(3.4)  X1  "  E  xjkj  i  E  lkjcjl- 

j-2  j-2 

Theorem  3.1.  The  maximum  value  of  subject  to 
(3.1)  and  (3.2)  j^s  equal  to  the  minimum  capacity  of  all 
e^— cuts . 

Proof.  It  suffices  to  show  that  there  is  a  flow  and 
an  e^— cut  for  which  equality  holds  in  (3.4).  A  proof  of 
this  can  be  given  using  either  the  linear  programming 
duality  theorem  [3,  16]  or  Dantzig's  simplex  method  for 
solving  linear  programs  [3].  We  sketch  the  former  approach. 
Let  X  «  (x^,  *2>  •••»  xn)  be  a  maximum  e^-flow.  The  duality 
theorem  for  the  linear  program  at  hand  then  implies  that  there 
exists  an  m-vector  (n^f  r^,  ....  n  )  such  that  the  following 
'’optimality*'  properties  hold: 

m 

(3.5)  1  +  £  ni*il  “  °' 

i»l 

and,  for  J  •  2 ,  . . . ,  n, 

m 

(3.6)  E  Vij  >  0  -  Xj  -  C  . 

i-1 

m 

E  vij  < 0  -  xj  -  -  v 

i-1 


Thus  Y  e  It  .  By  Lemma  1.1,  there  exists  an  elementary 
vector  K  *  (—1,  lt2»  •••>  k^)  of  It  that  conforms  to  Y. 

The  properties  (3.6)  then  hold  for  K  and  imply  that  equality 
holds  in  (3.4).  This  proves  Theorem  3.1. 

The  simplex  method  constructs  a  maximum  e^— flow  and 
a  minimum  e^— cut  simultaneously.  Indeed,  the  method 
proceeds  by  a  sequence  of  pivots  on  A.  and  at  termination 
yields  a  standard  representative  matrix  R  of  it,  one  of 
whose  rows  is  an  e^— cut  of  minimum  capacity. 

If  A  is  totally  unimodular,  then  the  coordinates  of 
K  in  Theorem  3.1  are  0,  1,  or  — 1 ,  and  we  have  a  more  purely 
combinatorial  result:  namely,  the  generalization  of  the 
max— flow  rain-cut  theorem  to  regular  matroids  or  digraphoids 
noted  in  (24 J.  Observe  that  the  analogue  of  the  integrity 
theorem.  Theorem  1-2  of  Part  I,  is  valid  for  this  case. 

Just  as  for  the  case  of  flows  in  networks,  the  assump¬ 
tion  of  symmetric  capacity  constraints  can  easily  be  dis¬ 
pensed  with  in  Theorem  3.1.  The  capacity  constraints  can 
be  changed  to  b^  <  x^  <  c^ ,  and  treated  in  a  similar  fashion, 
provided  they  are  feasible.  The  capacity  of  an  e^— cut 
K  ■  (-1.  k2 ,  • kn)  is  then  defined  to  be 
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(3  3)  £  kjcj  +  L  kjbj- 

J-2  j-2 

kjX>  k,<0 

The  feasibility  question  is  most  conveniently  disposed 
of  by  the  following  "generalized  circulation  theorem,"  the 
analogue  of  Theorem  1.3,  Part  I. 

Theorem  3.2.  Let  A  *  (a^ )  be  an  m  by  n  real  matrix , 
a°d  let  bj  <  c^  ,  j  "  2,  ...,  n,  be  given  real  numbers  ■ 

The  constraints 

n 

(3-7)  2aijxj  "  0,  i  -  1,  2,  ...,  m, 

(3-8)  bj  <  xj  £  cj>  j  -  1,  2,  ...,  n, 

ere  feasible  if  and  only  if,  for  each  elementary  vector 

K  •  (k^,  kj,  .  •  •  ,  kn)  iii  the  row  space  f  of  A,  we  have 


(3-9) 


kj>0 


kjCj 


kj<0 


kJbJ  * 


Notice  that  (3-9)  is  really  a  finite  set  of  inequalities, 
since  ue  need  only  choose  from  each  frame-line  of  *  one 
elementary  vector  and  its  negative  in  checking  (3.9). 

We  turn  next  to  the  painting  theorem  for  a  real  m  by  n 
matrix  A  -  (a^).  Here  we  paint  the  edges  of  E  -  fcj,  e2 , 

....  en),  i.e.  the  columns  of  A,  with  three  colors — red, 
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f 

k. 


white,  blue- — with  one  red  edge  being  distinguished  and 
painted  dark  red.  Two  edges  e^,  e^  are  similarly  oriented 
in  an  elementary  vector  X  ■  (x^,  X£>  •  xn)  of  a  sub¬ 
space  tic  >ln  if  XjXj  >  0;  X  contains  e^  if  e  S(X). 

Theorem  3.3.  Given  a  painting  of  E  ■  {e^,  e2>  • • • ,  e^} 
and  a  real  m  by  n  matrix  A  *  (a^j  )  having  row  space  ft , 
precisely  one  of  the  following  alternatives  holds : 

(i)  There  is  an  elementary  vector  X  of  it*  containing 
the  dark  red  edge  but  no  white  edge ,  in  which  all  red  edges 
are  similar ly  oriented . 

(ii)  There  is  an  elementary  vector  Y  of  f  containing 
the  dark  red  edge  but  no  blue  edge,  in  which  all  red  edges 
are  similarly  oriented. 

Proof.  Clearly  both  alternatives  can't  hold,  since 
It  and  it*  are  orthogonal. 

Delete  all  white  columns  of  A.  Pivot  on  blue  columns, 
one  after  another  in  any  order,  until  no  more  such  pivots  are 
possible.  (These  operations  correspond  to  the  "deletions"  and 
"contractions"  of  edges  in  graph  or  matroid  theory.)  Now 
delete  all  rows  and  columns  of  the  resulting  matrix  that 
contain  pivotal  elements.  Call  the  remaining  matrix  A. 

Note  that  any  blue  columns  of  A  consist  entirely  of  0's. 

(A  generates  a  matroid  minor  of  the  matroid  generated  by  A.) 
Let  %  denote  the  row  space  of  A.  It  follows  from  standard 
theorems  on  linear  inequalities  that  (just)  one  of  a  pair 
of  complementary  orthogonal  subspaces  contains  a  nonnegative 
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vector  whose  first  coordinate,  say,  is  positive.  Thus 
either  if  or  %*  (but  not  both)  contains  a  nonnegative 
vector  whose  dark  red  coordinate  is  positive.  Suppose 
Z  cil*  is  such  a  vector.  Then  Z  can  be  extended  to  a 
vector  Z  e  Jt*  such  that  white  coordinates  of  Z  are  all 
zero.  In  this  case  (i)  holds,  by  Lemma  1.1.  Suppose 
that  Z  e  H  is  a  nonnegatxve  vector  whose  dark  red 
coordinate  is  positive.  In  this  case  Z  can  be  extended 
to  a  vector  Z  e  <  such  that  all  blue  coordinates  of  Z 
are  zero.  In  this  case  (ii)  holds,  by  Lemma  1.1. 

If  A  is  totally  unimodular,  the  elementary  vectors 
X  and  Y  of  Theorem  3.3  can  be  taken  to  be  primitive,  and 
Theorem  3.3  reduces  to  the  painting  theorem  for  digraphoids 
[24]. 

We  return  now  to  the  version  of  Theorem  3.1  with  capacity 
constraints  b.  <  <  c ^ .  How  general  is  the  class  of  linear 

programs  encompassed  by  this  theorem?  The  answer  is  not 
hard  to  see:  it  includes  all  linear  programs.  For,  as  is 
well  known,  any  linear  program  can  be  put  in  the  form 


(3.10) 


n 


S  aijxj  “  bi'  i  “  1'  2,  •  •  • ,  m, 

j-1 

Xj  >  0,  j  ■  1,  2,  ...,  n, 


n 


maximize  cjxj  1 
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Introducing  new  variables  Xq  and  xn+^,  we  see  that  (3.10) 
is  equivalent  to 


<3.il)  £  aijxj  -  Vn+l  "  °' 

j-1 

n 

xo  -  £  cj*j  ■  °' 
j-i 


”  ®  <  x0  ^ 


0<Xj  <  ®,  j  *1,  2,  n. 


1  <  xn+l  < 


maximize  x. 


The  program  (3.11)  is  a  maximum  flow  problem  on  a  subspace 

f  —  n+2 

of  n 

Still  following  the  discussion  of  Part  I,  Section  1, 
let  us  look  now  at  the  general  version  of  the  path— edge 
formulation  of  the  maximum  flow  problem.  Is  there  an 
analogue  of  (1.9),  (1.10),  and  (1.11)  for  an  arbitrary 
real  matrix?  We  shall  see  that  there  is.  Consider  the  matrix 
whose  rows  consist  of  all  elementary  vectors  of  it*  of  the 
form  (1,  p2,  •••,  pn)-  Let  (pkj)>  k  -  1,  2,  ...,  s,  j  ■  1, 

2,  ...,  n,  denote  this  matrix  and  let  (lpkjl)  be  the  matrix 
obtained  by  taking  absolute  values  of  elements.  We  want 


-Al¬ 


to  show  that  the  programs 
s 

(3-12)  £  yk’  IPfcj  1  <  j  -  1,  2,  . .  n, 

k-1  J 

yk  ^  °' 

s  s 

maximize  J3  yk'|pkll  "  S  yk' 
k-1  k-1 


(3.13)  ^2  aijxj  “  1  *  1»  2,  . . . ,  m, 

j-1 

-c.  <Xj  <Cj>  j  -  1,  2,  ....  n. 


maximize  x^, 


are  equivalent.  Here  we  take  -  ®.  Civen  a  feasible 

solution  Y  -  (y^,  y2>  •  ••,  ys)  of  (3.12),  define 
s 

Xj  -  53  ykpkj’  Then  ~  xj  -  cj  and 

k-1  e  n 

/ 

E  auxj  -  E  E  «ijPkj  j-k  ■  °- 

1-1  k-1  '  1-1  ' 


Conversely,  given  a  feasible  solution  X  -  (x^,  x 2>  •  ••,  *n) 
of  (3.13),  we  use  the  conformal  frame  decomposition  (1.1)  to 


write 


-42- 


(3.X4)  x  -  yiPx  +  ...  +  y,P^  +  Fx  +  ...  +  Fh,  yk  >  0, 

where  P^,  . ..,  P^  are  the  first  t  rows,  say,  of  the  matrix 
(pkj),  and  each  elementary  vector  in  (3.14)  conforms  to 
X.  Define  yk  ■  0  for  the  remaining  rows  of  (Pkj)*  It 
follows  that 

s 

£  vifVj1  ^  cj- 

k-1 

Thus  (3-12)  and  (3.13)  are  equivalent  programs. 

In  particular,  if  A  is  totally  unimodular,  then 
(|pkjl)  is  a  (0*  l)-incidence  matrix,  and  an  integral  X  in 
(3.14)  yields  an  integral  Y  solving  (3.12).  Thus  integral 
capacities  lead  to  integral  solutions  in  both  programs. 

This  observation  establishes  an  analogue  of  Theorem  1.5,  Part 
I.  That  is,  an  analogue  of  the  edge  form  of  Menger's  theorem 
is  valid  for  regular  raatroids.  This  has  previously  been 
shown  by  Minty  in  (24J. 

It  seems  likely  that  the  relationship  between  (3.12) 
and  (3.13)  has  Implications  for  what  is  called  the 
"decomposition  principle"  in  linear  programming.  We  shall 
not  pursue  this  point  here. 

The  only  other  problem  from  Part  I  that  we  want  to 
examine  in  the  context  of  Part  II  is  the  length-width 
inequality.  (The  generalized  minimum  path  problem  is  the 
frame— dual  of  the  generalized  maximum  flow  problem  and  thus 
presents  nothing  new.  Part  III  will  be  devoted  to  a 
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very  general  combinatorial  analogue  of  the  maximum  capacity 
path  problem.)  Let  A  ■  (a^)  be  a  real  »  by  i-  matrix  with 
row  space  H,  and  suppose  i j ,  w^  are  given  nonnegative 
numbers  for  j  -  2,  n.  Consider  the  collection 

*  fP^,  Pr]  of  all  elementary  vectors  of  it* 

that  have  first  coordinate  1,  and  the  collection 
IS  “  {K^,  Kg}  of  all  elementary  vectors  of 

that  have  first  coordinate  1.  Let 

n 

(3.15)  X  -  min  £  IPjj'-j  l  » 

l^i<r  j .2 

n 

(3.16)  uu  -  min  V  |k,  .w.  | , 

l<k<s  £  nj  J 


where 


Pi  “  (1’  Pi2’  pin^’  1  "  l*  2’  *•**  r' 

-  (1,  kh2,  ...,  khn),  h  -  1,  2,  s. 

We  call  X  the  e^— length  of  A,  and  call  uu  the  e^-width  of  A. 

Theorem  3.4.  Let  A  •  (a^)  be  an  m  by  n  real  matrix 
having  e ^-length  X  relative  to  £  0,  j  •  2,  . . . ,  n,  and 
e^-width  uu  relative  to  w^  £  0 ,  j  -  2 ,  . .  .  ,  n .  Then 

n 

^  <  £  *jwj* 

J-2 


(3.17) 
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Proof.  By  Theorem  3-1,  Part  II  (the  generalized 
max— flow  min— cut  theorem)  and  the  equivalence  of  (3.12) 
and  (3.13),  it  follows  from  (3.16)  that  there  exists  a 
nonnegative  r— vector  Y  *  (y^,  . ..,  yr)  such  that 

r 

(3.17)  £  IXiPij I  <wj’  J  m  2>  •••* 

i-l 

i  -  - 

i-l 

Thus  we  have 

r  r  n  n 

x E  *1  *  E  E  *  E  Vj 

i-l  1-1  j-2  j-2 

by  (3.18),  (3.15),  and  (3.17),  respectively. 

Again  if  A  is  totally  unimodular,  then  each  P  e  1* 
and  K  €  It  is  primitive;  taking  -  w^  -  1  gives  a  direct 
generalization  of  the  Moore— Shannon  theorem  for  graphs 
to  totally  unimodular  matrices.  In  matroidal  terms: 

Corollary  3-5.  Let  (e,  G)  ba  a  regular  matroid  fin 
n  edges •  Let  X (e)  +  1  be  the  least  number  of  edges  in  any 
circuit  containing  edge  e,  iu(e)  +  1  the  least  number  of 
edges  in  any  cocircuit  (circuit  of  the  dual  matroid) 
containing  e.  Then  X(e)uj(e)  <  n  —  1. 
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PART  III  -  BLOCKING  SYSTEMS 

In  Part  I  we  described  the  maximum  capacity  path 
problem  for  a  two-terminal  network,  gave  a  good  algorithm 
for  solving  it,  and  presented  a  min-max  theorem  concerning 
paths  and  cuts  for  the  problem-  Similarly,  Gross  [17]  has 
described  a  good  algorithm  and  a  min-max  theorem  for  the 
"bottleneck  assignment  problem":  Given  a  square  array  of 
real  numbers,  find  a  circling  of  entries  with  exactly  one 
circle  in  each  row  and  in  each  column  so  as  to  maximize 
the  value  of  the  smallest  circled  entry-  For  an  interpre¬ 
tation,  think  of  rows  of  the  array  as  corresponding  to 
men,  columns  to  jobs  on  a  serial  assembly  line,  with  the 
entry  in  row  i  and  column  j  being  the  rate  at  which  man 
i  can  process  items  if  he  is  assigned  to  job  j.  The 
theorem  established  in  [17]  for  this  problem  is  the  fol¬ 
lowing:  Let  I  «  [1,  2,  ...,  n) ,  let  'P  be  the  set  of 
permutations  of  I,  let  |c|  denote  cardinality  of  C,  and 
let  a^,  i  e  I,  j  e  I,  be  real  numbers-  Then 


max  min  a 
PeP  iel 


i,P(i) 


min  max  a 

A.BiI  ieA 

|  A |  +]  B |  *n+l  jeB 


ij 


The  resemblance  between  these  two  min-max  theorems  is 
more  than  superficial.  They  are,  in  fact,  special  cases 
of  a  general  theorem  for  a  combinatorial  structure  which 
might  be  called  a  blocking  system.  These  systems  have  arisen 
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in  numerous  contexts  (see  [21,  22,  29],  for  example),  but 
the  particular  axiomatizatlon  and  general  min-max  theorem 
presented  in  [7]  and  surveyed  here,  have  apparently  not 
been  noted  before. 

1.  AXIOMS  AND  EXAMPLES 

Let  E  be  a  finite  set,  and  let  "P  and  ft  be  two  families 
of  subsets  of  E.  We  call  (E,  P,K)  a  blocking  system 
(on  E)  if  the  following  two  axioms  are  satisfied: 

(1.1)  For  any  partition  of  E  into  two  sets  E^  and  Eq 
(Eq  n  E^  ■  t  and  Eq  u  E^  -  E),  there  is  either 

a  member  of  P  contained  in  E^  or  a  member  of 
V  contained  in  Eq,  but  not  both. 

(1.2)  No  member  of  P  contains  another  member  of  P\  no 
member  of  V  contains  another  member  of  ft. 

The  first  axiom  (11)  can  be  phrased  in  terms  of 
painting  elements  of  E  with  two  colors;  For  any  blue- 
red  painting  of  E,  there  is  either  a  blue  P  in  P  or  a  red 
K  in  ft,  but  not  both.  The  second  axiom  (1.2)  is  more  a 
convenience  than  a  necessity  for  our  purposes,  as  will  be 
clearer  later  on. 

Observe  that  if  (E,  P,  ft)  is  a  blocking  system,  then 
for  each  P  c  P  and  K  t  ft,  we  have  PH  K  ^  t,  by  virtue  of 
the  last  phrase  in  (1.1).  In  other  words,  each  member  of 
X  blocks  all  members  of  P,  and  vice-versa.  Note  also  that 
the  axioms  (1.1)  and  (1.2)  are  self-dual:  Interchanging 
the  roles  of  P  and  ft  alters  neither  1.1)  nor  (12). 
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If  V  is  empty,  then  S'  ■  {(5}  satisfies  (1.1)  and  (1.2). 

Examp1 es  of  blocking  systems  abound.  Some  reasonably 
interesting  ones  will  be  described.  But  first  we  state  and 
prove  a  theorem  ’"hat  indicates  the  great  profusion  of 
blocking  systems.  Its  proof  provides  another  characteriza¬ 
tion  of  blocking  systems. 

Following  (7],  we  shall  call  a  family  /  of  subsets  of 
E  a  clutter  on  E  if  no  member  of  f  contains  another  member 
of  J*. 

Theorem  1 ■ 1 ■  Let  E  be  a  finite  set  and  let  ^  be  a 
clutter  on  E.  Then  there  exists  a  unique  clutter  yen  E 
such  that  (E,  iP,  ^)  is  a  blocking  system. 

Proof.  Let  K  €  X  if  and  only  if  K  n  P  f  0  for  all 
P  a  -f  and  K  is  minimal  with  respect  to  this  property.  To 
verify  that  (E,  '?>  X)  is  a  blocking  system,  it  suffices  to 
check  (1.1).  Thus  consider  a  blue-red  painting  of  E. 

Suppose  there  is  no  blue  P  c  P.  Let  R  be  the  set  of  all 
red  members  of  E  that  belong  to  some  P  c  1* .  Since  there 
is  no  blue  P  e  P,  we  have  R  f  P  M  for  every  Pc  'P. 

Hence  there  is  a  K  c  X  such  that  K  c  R,  i.e.,  there  is  a 
red  K  c  X-  If  there  were  both  a  blue  P  c  P  and  a  red  K  (  X, 
then  P  h  K  »  t,  contradicting  the  definition  of  K-  Thus 
(1.1)  holds  and  (E,  P,  K)  is  a  blocking  system. 

To  establish  uniqueness,  let  (E,  P,  K)  and  (E,  P,  X! ) 
be  blocking  systems  on  E  with  gf ^  .  Interchanging  the 

roles  of  X  and  *f*  if  necessary,  we  may  suppose  K  c  X  -  x' . 


t 
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Consider  the  partition  E  -  K,  K  of  E  By  (11)  applied 
to  (E,  P,  K) ,  no  subset  of  E  -  K  is  a  member  of  P.  Hence 
by  (1.1)  applied  to  (E,  P,  *£’),  there  is  a  K'  c  Ft’  with 
K'  c  K-  Now  consider  the  partition  E  -  K',  K'  of  E.  By 
(1.2),  no  subset  of  K'  is  a  member  of  X.  Hence  by  (1.1) 
applied  to  (E,  P,  X),  there  is  a  P'  e  P  with  P*  c  E  -  K1. 

But  then  P'  and  K'  violate  (1.1)  for  the  blocking  system 
(E,  P,  )  and  the  partition  E  -  K' ,  K'  of  E.  This  con¬ 
tradiction  proves  Theorem  1.1. 

Thus  if  P  is  an  arbitrary  clutter  on  E,  the  family 
ft  »  P*  of  all  "minimal  blockers"  of  P  is  the  unique 
family  of  Theorem  1.1,  and  X*  •  P**  -  P. 

The  primary  role  of  (12)  is  to  obtain  uniqueness  in 
Theorem  11.  Uniqueness  could  be  achieved  in  other  vays. 

For  instance,  instead  of  normalizing  to  clutters  P  and  X 

in  Theorem  1.1,  we  could  normalize  to  the  families 

and  of  all  supersets  of  members  of  P  and  'X,  respectively. 

Some  examples  of  blocking  systems  follow. 

Example  1 •  Let  E  be  the  set  of  edges  of  a  graph  G, 

P  the  family  of  elementary  paths  joining  two  vertices  of 
G,  end  Ft  the  family  of  elementary  cuts  separating  the  two 
vertices . 

fefiSEilJfc-  Let  E  be  the  set  of  cells  in  an  n  by  n 
array;  let  P  be  the  family  of  subsets  P  c  E  having  the 
property  that  there  is  just  one  cell  of  P  in  each  row  and 
column  of  the  array;  let  yC  be  the  family  of  subsets  K  c  E 
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such  that  K  is  a  p  by  q  subarray  withp+q*n+l. 

(That  (E,  p,  X)  is  a  blocking  system  follows  from  a  well- 
known  theorem  of  Konig  [20]  which  asserts  that  in  an  n  by 
n  (0,  l)-matrix,  the  maximum  number  of  l's,  no  two  of 
which  lie  in  the  same  row  or  column,  is  equal  to  the 
minimum  number  of  rows  and  columns  that  contain  all  the 
l's  of  the  array.)  More  generally,  let  P  be  the  family  of 
subsets  P  c  E  such  that  |p|  »  t  and  P  has  at  most  one  cell 
in  each  row  and  column.  Then  X  is  the  family  of  subsets 
K  c  E  such  that  K  is  a  p  by  q  subarray  with  p  +  q  «  2n  —  t  + 

Example  3.  Let  E  ■  { 1,  2,  ...,  2k-l) ,  let  P  be  the 

family  of  all  k-element  subsets  of  E,  and  let  X  *  P ■  (In 
multi-person  game  theory,  this  example  is  known  as  the 
"straight  majority  game.") 

Example  4.  Let  E  be  the  set  of  edges  of  a  graph  G, 

let  P  be  the  family  of  maximal  trees  of  G,  and  let  X  be 

the  set  of  all  elementary  cuts  (cocircuits)  of  G.  (A 
tree  of  G  is  a  subgraph  of  G  that  contains  no  circuit;  a 
”>aximal  tree  is  a  tree  of  G  that  is  maximal  with  respect 
o  this  property.) 

Example  5.  Let  E  be  the  set  of  edges  of  a  graph  G, 
let  P  be  the  family  of  circuits  in  G,  and  let  K  be  the  set 
of  cotrees  (complement s  in  E  of  trees)  of  G. 

Example  6-  Let  E'  be  the  set  of  edges  of  a  matroid 
(E',  <?),  let  E  «  C'  -  {e}  for  some  e  c  E* ,  and  let  P  be 
the  family  of  subsets  P  of  E  such  that  [e]  U  Pc  C  • 
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Then  K  is  the  family  of  subsets  K  of  R  such  that  {e}  u  K  t  tf*. 

Here  (E1,  £*)  is  the  matroid  dual  to  (E',<S). 

Example  7.  Let  E  be  the  set  of  vertices  of  a  graph  G, 

and  let  'P  be  the  family  of  pairs  of  adjacent  vertices  of  G 
(two  vertices  are  adjacent  if  they  are  joined  by  an  edge.) 

Then  H  is  the  family  of  subsets  of  vertices  K  such  that 
K  covers  all  edges  of  G,  and  is  minimal  with  respect  to 
this  property.  (In  other  words,  X  is  the  family  of  all 
"minimal  blockers"  of  f* . ) 

It  is  frequently  difficult,  as  illustrated  by  Example 
7,  to  find  a  useful  description  of  the  dual  clutter  P of 
a  simply  described  clutter  P. 

One  of  the  most  important  problems  concerning  blocking 
systems,  a  problem  that  arises  time  and  again  in  applications, 
is  the  minimum  covering  or  blocking  problem:  Given  a  simple 
description  of  P,  find  a  good  algorithm  that  constructs 
K  €  Tfc'  such  that  J  K j  is  a  minimum.  For  example,  we  might 
be  given  P  explicitly,  say  in  the  form  of  an  incidence 
matrix  A  «  (a(P,  e)),  where  a(P,  e)  «  1  or  0  according  as 
e  e  P  or  e  |  P.  The  minimum  blocking  problem  then  is 
equivalent  to  solving  the  following  linear  program  in 
integers  x(e)  *  0  or  1: 

(1.3)  S  a(P,  e)x(e)  >  1,  all  P  e  p, 

eeE 

minimize  x(e). 
eeE 
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Various  methods  have  been  proposed  for  such  problems, 
but  no  good  algorithms  are  known.  Indeed,  most  of  the 
methods  that  have  been  propped  can  be  shown  to  be  bad: 
the  amount  of  computational  effort  increases  exponentially 
with  the  size  of  the  problem. 

There  is  a  good  algorithm,  however,  for  computing 
the  following  lower  bound  on  the  minimum  in  (1.3).  Consider 
the  class  CL  of  all  (0,  l)-matrices  having  the  same  row 
and  column  sums  as  A.  For  A  in  CL>  let  n(A)  denote  the 
minimum  in  (1.3),  and  let 

(1.4)  w  ~  min  jj(A)  . 

Ae  CL 

The  integer  uj  has  been  explicitly  evaluated  by  Fulkerson 
and  Ryser  in  (14),  and  a  very  simple  construction  for  a 
matrix  Ik  in  <2-  such  that  uu(X)  *  w  has  been  given  in  [15]. 

2.  THE  MIN— MAX  THEOREM 

The  analogue  of  Theorem  3.1,  Part  I,  is  valid  for  all 
blocking  systems,  and  can  be  viewed  as  characterizing 
blocking  systems: 

Theorem  2.1.  Let  (E,  P,  fC)  be  a  blocking  system,  and 
let  f  be  a  real-valued  function  defined  on  E.  Then 

(2.1)  max  min  f(e)  «  min  max  f(e). 

PtfeeP  Ketf  eeK 


Conversely,  if  7® .and  H  ar£  gjy, £H  E  such  that  (2.1) 
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holds  for  every  real-valued  f  defined  on  E,  then  (E, V ,  #) 
is  a  blocking  system. 

Proof.  The  proof  that  (2.1)  holds  for  a  blocking 
system  is  entirely  analogous  to  the  proof  of  Theorem 
3.1,  Part  I.  In  brief:  The  left-hand  side  of  (2.1)  is 
less  than  or  equal  to  the  right-hand  side  since  P  fl  K  is 
nonempty  for  each  P  e  P,  K  e  ¥.  To  establish  equality, 
order  the  elements  of  E  according  to  decreasing  values  of  f; 
then  paint  elements  of  E  blue,  one  after  another,  until 
the  blue  set  first  contains  an  element  of  . 

(In  other  words,  the  threshold  method  establishes 
equality  in  (2.1)  and  simultaneously  evaluates  (2.1). 

It  will  be  a  good  method  for  this  evaluation  in  case  there 
is  a  good  method  for  recognizing  whether  an  arbitrary  sub¬ 
set  of  E  contains  a  member  of  P  (or  a  member  of  t£) .) 

Conversely,  let  P  and  ft  be  clutters  on  E  and  suppose 
(2.1)  holds  for  every  real-valued  f  defined  on  E.  Let 
f(e)  ■  1  or  0  according  as  e  is  blue  or  red.  Suppose  there 
is  no  blue  P  e  P.  Then 

max  min  f(e)  »  0  «  min  max  f(e). 

P eP  eeP  Kelt  eeK 

If  there  were  no  red  K  e  It,  we  would  have 

min  max  f(e)  ■  1, 

Ketf  eeK 
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a  contradiction.  Hence  there  is  a  red  K  e  .  On  the 
other  hand,  if  there  were  both  a  blue  P  €  V  and  a  red 
K  e  X,  then 

max  min  f(e)  *  1,  min  max  f(e)  *  0, 

PejP  eeP  KeJt  eeK 

contradicting  (2.1).  Hence  (E,  P,1C)  is  a  blocking 
system. 

3.  THE  LENGTH-WIDTH  INEQUALITY  AND  MAX-FLOW  MIN-CUT 
EQUALITY 

Let  (E,  p,  K)  be  a  blocking  system,  and  suppose 
t(e),  w(e)  are  two  nonnegative  numbers  associated  with 
element  e  e  E.  Define  the  length  of  the  system  to  be 

(3.1)  \  -  min  Z I  t(e) , 

P eP  eeP 

and  the  width  to  be 

(3-2)  uu  •  mi n  E  w(e) . 

KeX  eeK 

Following  Lehman  [22 J,  we  shall  say  that  the  length-width 
inequality  holds  for  (E,  P,  fC)  if 

(3.3)  Xuu  <  £  t(e)w(e) 

eeE 

is  satisfied  for  every  pair  of  nonnegative  functions  l,  w 
defined  on  E. 
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For  instance,  if  (E,  *>,  X)  is  the  blocking  system 
of  Example  1,  we  have  seen  in  Part  I  that  the  length- 
width  inequality  holds.  It  also  holds  for  Example  6 
provided  the  underlying  matroid  is  regular;  this  is  a 
corollary  of  Theorem  3.4,  Part  II.  On  the  other  hand, 
the  length-width  inequality  fails  for  the  blocking 
system  of  Example  3. 

For  each  P  e  f3  and  e  €  E,  define  a(P,  e)  ■  1  or  0 
according  aseePore^P.  Now  consider  the  linear 
program 

(3-3)  £  y(P)a(P,e)  <  w(e),  e  e  E, 

y(P)  >0,  p  e  V, 

maximize  £  y(P) . 

P  eP 

Clearly  the  maximum  in  (3.3)  is  less  than  or  equal  to 
the  width  of  (E,  ,  K) .  If  equality  holds  here  for 
every  nonnegative  w  defined  on  E,  we  say,  as  in  [22], 
that  the  max— flow  min— cut  equality  holds  for  (E,  1°,  X) . 

Thus,  for  instance,  the  max-flow  min-cut  equality 
holds  for  Example  1,  for  Example  6  if  the  underlying 
matroid  is  regular,  and  fails  for  Example  3,  just  as  for 
the  length-width  inequality. .  This  behavior  is  not 
accidental.  One  of  the  main  results  of  [22]  is  that  the 
max-flow  min-cut  equality  holds  for  a  blocking  system 
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if  and  only  if  the  length-^width  inequality  holds. 
Consequently,  if  the  max-flow  min-cut  equality  holds 
for  (E,  P,  H) ,  it  also  holds  for  (E,  fC,  P) ,  since  the 
roles  of  P  and  X  are  symmetric  in  the  length-width 
inequality. 

In  any  event,  the  problem  of  evaluating  the  width  of 
a  blocking  system  for  a  given  nonnegative  function  w  is  a 
generalization  of  the  minimum  blocking  problem  mentioned 
earlier.  It  would  be  interesting  to  discover  other 
significant  classes  of  blocking  systems  for  which  the 
length-width  inequality,  and  hence  the  max-flow  min— cut 
equality,  holds. 
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A  (directed)  graph  G,  for  purposes  here,  Is  a  finite  set  of  nodes  and 
a  finite  set  of  edges .  where  each  edge  la  said  r.o  be  directed  toward  one  of 
the  node*,  called  the  front  end  of  the  edge,  and  said  to  be  directed  away 
ftp*  *  different  one  of  the  nodes,  celled  the  rear  end  of  the  edge.  An  edge 
end  each  of  its  ends  are  said  to  eeet.  A  subgraph  of  G  la  a  Subcollectloo 
of  its  neabers  which,  under  r  a  «ut  incidence  relations,  la  a  graph.  A 
graph  la  callad  connected  If  it  ie  not  empty  end  its  aaabera  do  not  partition 
Into  two  disjoint  aen-eatpty  subgraphs.  A  polygon  is  a  connsctad  graph  Q 
auch  that  tach  rode  of  Q  neats  exactly  two  edges  of  Q  .  An  (ale— nt err 
kflJLtglSll  directed)  circuit  ie  s  polygon  which  contains  one  edge  directed 
toward,  end  one  edge  directed  sway  fron,  each  of  it*  nodes.  A  forest  is  e 
graph  which  contains  no  polygon.  A  tree  i*  a  contacted  fpreet.  A 
la  a  foraat  whoaa  edges  are  directed  so  that  each  Is  dlrscttd  toward  a  different 
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nod*.  An  arborsscsncs  1*  *  connected  branching.  An  (elsacntatv  uniformly 
directed)  path  P  i#  an  arbortscenct  Much  that  tach  edge  in  P  la  directed 
away  from  a  different  node,  and  such  that  thara  la  at  laast  oca  edge  in  P  . 

We  shall  occasionally  use  "obvious”  facta  about  graphs  without  justify¬ 
ing  thaw. 

Clearly,  a  branching  (foraat)  is  tb«  union  of  a  unique  f sally  of  dis¬ 
joint  arborascancas  (trees). 

Exactly  one  node  la  an  arberesceaca  T  ,  called  the  root  of  T  ,  has  no 
edge  of  T  directed  toward  it.  A  branching  (forest)  is  an  arbor* seance  (tree) 
if  and  only  if  it  has  exactly  one  less  edge  than  nodes.  No  branching  (forest) 
has  nor*  edges  than  this. 

In  a  path  P  there  are  exactly  two  nodes,  callad  tha  ands  of  P  ,  which 
each  mat  only  ona  edge  in  P  .  The  rest  of  the  nodee  in  P  each  met  exactly 

two  edgea  in  P  .  A  path  P  ic  said  t£  gg  fron  tha  noda  which  la  only  a  raar 

and  in  P  (tha  root  of  P)  tg  the  node  which  is  ouly  s  front  end  in  P  .  For 

any  erboreecence  T  ,  and  any  node  v  in  T  except  the  root ,  there  la  e 

unique  path  in  T  going  fron  the  root  to  v  •  Any  path  in  T  going  to  V 
end  any  path  in  T  going  fron  v  have  only  \f  in  conmon,  end  their  union  ie 
a  path.  And  so  on. 
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Let  G  be  any  graph  with  a  real  numerical  weight  corresponding 
to  each  edge  e^eG  .  The  problem  treated  here  is  to  find  in  G  a  branch¬ 
ing  B  which  has  aaxlmum  total  weight,  T.  ,  sunned  over  e^<  B  .  B  la 
called  an  optimum  branching  in  G  . 

First  we  show  that  certain  variations  of  the  problem  reduce  immediately 
to  it. 

A  spanning  subgraph  of  G  is  a  subgraph  which  contains  all  the  nodes 
of  G  .  A  branching  in  G  is  a  spanning  ar bore seance  of  G  if  and  only  if 
the  number  of  its  edges  ia  one  less  than  the  number  of  nodes  in  G  .  Ho 
branching  in  G  can  have  more  edges  than  this. 

An  opt lmm  branching  in  G  of  course  containa  no  edge  with  negative 
weight,  and  Indeed  may  be  empty  if  all  c^  <  0  .  Even  if  all  >  0  and 
G  contains  a  spanning  arborescence,  an  optimum  branching  in  G  need  not  be 
an  arborescence. 

If  there  is  a  spanning  arborescence  T  in  G  ,  then  an  optimum  one,  i.e. 
one  which  haa  maximum  total  weight,  E  ,  yT  ,  can  be  found  as  an  optimum 

branching  in  G  where  the  edges  carry  new  weights  c^ 1  *  +  h,  h  >  E  |c^|  , 

SjCG  .  A  spanning  arborescence  in  G  which  is  optimum  relative  to  weights 
Cj,  e^eG  ,  is  elso  optimum  relative  to  weights  c^  +  k,  e^cG  ,  for  any 
constant  k  ,  since  every  spanning  arborescence  has  the  seme  number  of  edges. 


Constant  h  is  larger  than  the  difference  in  total  weights  (relative  to 
weights  0^,  SjC  6)  of  any  two  branchings  in  G  .  It  follows  that  an  optiaua 
branching  in  G,  relative  to  weights  c^'  ■  c^  +  h  ,  will  be  a  branching  with 
a  maximum  number  of  edges.  In  particular,  it  will  be  a  spanning  rroorescence 
if  and  only  if  G  contains  a  spanning  arborescence . 

A  spanning  arborescence  T  in  G  which  has  minimum  total  weight, 
Ec^e^I,  is  the  sane  as  one  which  has  maximum  total  weight  E  c^',  e^eT  , 
relative  to  weights  c^ '  -  -c^  . 

It  will  be  evident  that  the  efficiency  of  the  method  for  treating  optimum 
branchings  is  not  seriously  effected  by  a  large  change  h  (say  of  the  fora  10n) 
in  all  the  weights.  In  fact  the  nethod  is  easily  modified  to  treat  optima 
spanning  arborescences  directly. 

If  there  is  a  spanning  arborescence  in  G  which  is  rooted  at  a  prescribed 
node,  say  r  ,  then  an  optiaua  one  can  be  found  by  finding  an  optimum  spanning 
arborescence  in  the  graph  G'  obtained  from  G  by  adjoining  a  new  edge  e^ 

(Wiry teg  arbitrary  weight  Cq)  which  is  directed  toward  r  and  directed  from 
a  new  node  having  no  other  incident  edges.  Clearly,  T  is  a  spanning 
arborescence  in  G  which  is  rooted  at  r  if  and  only  if  T  together  with 
eQ  is  a  spanning  arborescence  of  G' 

If  the  edges  in  graph  G  represent  the  links  for  possible  direct  comunl- 
catlon  from  one  node  to  another,  ii  each  c^  is  the  cost  of  direct  ro— in  1  r  n 
tlon  froa  the  rear  end  of  Oj  to  the  front  end  of  s^ ,  and  if  cost  is  additive, 


then  &  minimum-total -weight  spanning  arborescance  rooted  at  prescribed  node 
r  represents  the  least  costly  way  to  have  a  sassage  coanunicated  from  r 
to  all  other  nodes  of  G  . 

Another  application  is  where  it  is  desired  to  arrange  an  institution 
into  an  optimum  heirarchy  (branchocracy) . 

$3 


Our  main  result  is 

Theorem  1 .  There  exists  a  good  algorithm  for  finding,  Jjj  any  graph  G 
with  a  numerical  weight  corresponding  to  each  edge,  an  branching . 

We  say  an  algorithm  is  good  if  there  is  a  polynomial  function  f(n)  which, 
for  every  positive -Integer  valued  n,  is  an  upper  bound  on  the  "amount  of  work" 
the  algorithm  does  for  any  input  of  "alee"  n  .  The  concept  is  easy  to 
formalize  —  —  relative,  say,  to  a  Turing  machine,  or  relative  to  any  typical 
digital  computer  with  an  unlimited  supply  of  tape. 

For  optimum  branching,  the  largest  number  of  significant  digits  in  an 
edge  weight,  as  well  as  the  number  of  edges  of  G  ,  must  be  figured  somehow 
into  the  measure  n  of  input  "size"  .  One  might  for  example  take  n  to  be 
the  maximum  of  these  two  numbers  or  to  be  the  vector  consisting  of  both 
numbers. 

theorem 

The  proof  of  Theorem  1  is  constructive.  The/ls  proved  by  displaying  one 
particular  algorithm  for  optimum  branching  which  is  obviously  good. 


If  m  ruuv*  fro*  the  optimum-spanning -arboraacancs  problem  the  con¬ 


dition  that  aach  member  of  tha  aat  T  of  a dgaa  being  optimized  must  have 
a  different  front  and,  than  we  get  the  optlmum-spanniag-trea  problem.  That 
la  to  find.  If  there  la  one.  In  any  graph  6  with  a  meter  leal  weight  on 
each  edge,  a  spanning  tree  which  has  maximum  (or  minimum)  total  weight. 

Especially  simple  algorithms  are  well-known  for  this  problem  [cf.  5 
and  6].  One  is,  starting  with  an  empty  bucket,  build  up  a  set  of  elements 
having  "admissible  structure"  by  putting  elements  into  the  bucket  one  after 
another  as  long  as  possible,  so  that  each  addition  is  a  maximum  weight  ele¬ 
ment  among  those  not  In  the  bucket  which,  together  with  the  ones  already  in 
the  bucket,  would  preserve  admissible  structure.  For  the  optlmum-spannlng- 
tree  problem,  tha  elements  are  the  edges  of  G  and  "admissible"  means 
"forest".  The  algorithm  is  certainly  good.  It  Is  also  valid  for  that  pro¬ 
blem. 

Where  admissible"  means  "branching",  tha  above  algorithm  is  not  gener¬ 
ally  valid  for  finding  an  optimum  spanning  arborescenca .  Paper  [3]  abstractly 
characterises  those  structures  for  which  this  "greedy  algorithm"  is  valid  for 
am*  nine  rlcal  weighting. 

If  we  add  to  the  conditions  of  the  opt lmua-spsxralng-ar bora scenes  problem 
the  condition  that  each  member  of  the  set  of  edges  being  optimised  is  to  have 
a  different  rear  end,  then  we  have  the  problem  of  finding,  if  there  is  one, 
an  optimum  spanning  (uniformly  directed)  path  in  any  graph  G  with  a  numerical 
weight  on  each  edge.  This  is  a  version  of  the  well-known  traveling  saleman 
problem  [cf.  4].  I  conjecture  that  there  is  no  good  algorithm  for  the  traveling 
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■aleman  problem.  Ky  reasons  arc  the  ease  ac  for  any  mathematical  eonjactura: 
(1)  It  la  a  legitimate  mathematical  possibility,  and  (2)  I  do  not  knov. 

A  matching  in  a  graph  la  a  aubact  of  Ita  edgea  auch  that  no  two  mi  them 
meat  the  aame  node.  A  good  algorithm  la  known  for  finding,  la  any  graph 
with  a  numerical  weight  on  each  edge,  a  maximum -total -weight  matching.  The 
treatment  [1  and  2]  of  maximum  matching*  and  the  treatment  here  of  optimum 
branchings  are  similar,  though  the  structural  details  are  different  and  maxi¬ 
mum  matching  ia  more  cosspllcated. 

«4 

Here  ia  the  algorithm  for  finding  a  maximum-total  weight  branching 
in  any  (directed)  graph  G  with  a  numerical  weight  on  each  edge 
e^«G .  Recall  that  a  branching  la  a  forest  auch  that  eoch  edge  la  directed 
toward  a  different  node. 

Begin  the  algorithm  by  applying  Instruction  (11)  where  0*  la  G®  ■  G 
and  where  D*  and  E*  are  empty  buckets,  D®  and  I® 

(11 )  Choose  a  node  y  In  G*  and  not  la  3*  Put  y  Into  bucket 
D1  .  If  there  la  In  G1-  a  positively  weighted  edge  directed  toward  y  ,  put 
one  of  then  having  maximum  weight  Into  bucket  I*  . 
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Repeat  (II)  until 

(a)  E*  no  longer  comprises  the  adgae  of  e  branching  in  G*,  or  until 

(b)  every  node  of  G*  is  in  D*  ,  end  R*  does  comprise  the  edges  of  e 
branching.  When  case  (a)  occurs,  apply  (  I  2). 

For  convenience  assume  that  every  branching  which  we  consider  in  graph 

G*  contains  all  the  nodes  of  G*  .  We  say  that  a  set  of  edges  in  G*  forma 

the  unique  subgraph  of  G*  consisting  of  those  edges  and  ell  nodes  in  G*  . 

Each  edge  e  put  into  E1  according  to  ( I  1  )  is  directed  toward  a 

node  p  which  is  the  root  of  a  connected  component  of  the  branching,  say  B, 

formed  by  the  edges  in  E1  before  e  is  put  into  E1  .  If  the  rear  end  p. 

o 

of  e  is  in  a  different  component  of  B  than  p  ,  then  BUe  is  a  branching, 
and  so  wh-jn  e  is  put  into  E*  ,  (a)  does  not  hold. 

If  is  in  the  seas  component  of  B  as  p  ,  then  B  contains  a  unique 

path  F  going  from  p  to  p^  .  In  this  case,  Q1  *  File  is  a  circuit  con¬ 
tained  in  Bile  ,  so  as  soon  as  e  is  put  into  E*,  (a)  does  hold. 

(I  2)  Store  Q*  and  a  specification  of  one  of  the  edges,  say  ej  ,  of 
Q1  which  has  minimum  weight  in  Q1  relative  to  the  edge-weights  for  G1  . 
Obtain  a  new  graph  G1+1  from  G1  by  "shrinking"  to  a  single  new  node, 

,  Che  circuit  Q*  end  every  edge  of  Gl  which  has  both  ends  in  Q1.  The 
edges  (denoted  as  e*+l)  of  Gi+1  are  those  edges  (denoted  as  a*  )  of  G* 

which  have  at  most  one  end  in  Q*  .  Every  edge  of  G1  which  has  one  end  in 

Q*  will  in  Gi+^  have  at  th»t  end.  All  other  edge-ends  are  the  same 

in  Gi+l  as  in  Gl  .  The  nodes  of  Q1  are  not  in  Gl+l  . 
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Every  edge,  aay  e*+1  ,  which  as  e*  in  G1  la  directed  toward  a  node, 
aey  |/*  ,  in  Q*  end  directed  away  froa  a  node  not  in  ,  gate  a  poesibly 
different  weight  for  G*+1 


(1) 


i+1 

C3 


i  .  i 
C3*C0 


c 


i 

4 


where  c*  ia  the  weight  of  e*  for  8*  j  where  Cq  ia  the  niainuai  weight 

i  i  1  i 

for  G  of  an  edge,  aey  e^  ,  in  Q  ;  and  where  c^  ia  the  weight  for 

1  i  1  * 

G  of  the  unique  edge,  aay  e^  ,  which  la  in  Q  and  directed  toward  y*  . 

All  other  edgea  in  C1+1  keep  the  ease  weight  aa  for  . 

In  Juetlfying  the  algorithm  we  ahall  make  uae  of  the  following  relatlona 

(2)  cj  >  0  ,  (3)  cj  >  cj  ,  end  (4)  cj  £  c* 


Put  into  bucket  Di+l  the  node a  which  are  in  both  and  bucket  D*  . 

Put  into  bucket  Ei+i  the  edgea  which  are  in  both  G*+*  and  bucket  I*  ,  i.e., 
put  into  bucket  Ei+*  the  final  contenta  of  bucket  B*  ninua  the  edgea  of 

1  4X1 

circuit  Q  It  ia  eaey  to  aee  that  the  edgea  in  bucket  K  fora  a  branch¬ 
ing  in  Gl+l  . 

Continue  the  a  Igor  it  ha  by 

applying  (II)  where  1  la  one  greeter. 

Eventually,  after  e  aaall  number  of  epplicatlona  of  (II)  and  (I  2)  , 


caae  (b)  auat  occur. 
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As  soon  as  (b)  occurs,  for  say  i  -  k  ,  (I  l)  and  (I  2)  srs  never 

applied  again.  Instead,  (I  3)  Is  applied  successively  for  i  +  l  •  k, 

k-l,  ...  ,  1  ,  until  ths  graph  Gl  obtained  is  tha  original  G  .  At  that 

point,  tha  branching  B1  -  B°  is  a  narinun- total -weight  branching  of  G  . 

k  k 

Tha  final  contents  of  bucket  I  fora  a  branching  in  graph  G  which 
we  call  B*  . 

(13)  It  Is  not  difficult  to  see  that  since  B1+1  is  a  forest  In 
Gi+1  and  since  Gl+1  Is  obtained  fron  G1  by  shrinking  the  circuit  Q1  in 
G1  (and  all  edges  of  G1  with  both  ends  in  Q1  )  to  the  node  of  G1*1  , 

the  subgraph  H1  of  G1  ,  forasd  by  ths  edges  in  B1+1  and  the  edges  In  Q*, 
contains  only  one  polygon,  nanely  Q* 

In  the  case  where  is  not  a  root  of  (a  connected  coaponent  cf) 

branching  Bi+1  in  Gi+l  ,  there  is  a  unique  edge,  say  ej+1  ,  of  Bi+l 
which  is  directed  toward  .  In  C1  ,  ej  is  directed  toward  a  node, 

say  ,  of  Q1  .  Since  Q1  is  a  circuit,  there  is  a  unique  edge,  say  e£  , 
of  Q1  which  is  directed  toward  .  Clearly,  e*  and  e^  are  the  only 

two  edges  of  H1  which  are  directed  toward  the  sane  node.  Thus,  since  e^ 
is  in  the  only  polygon  of  ,  deleting  e^  frow  H1  yields  a  branching 
in  G*  ,  which  is  called  B*  . 

In  the  case  where  is  a  root  of  branching  B***  in  Gl+*  ,  i.e., 

where  no  edge  of  Bi+l  is  directed  toward  ,  no  two  edges  of  H1  ate 

directed  toward  the  sene  node.  Therefore,  deleting  any  edge  of  Q*  from  H* 
yields  a  branching  in  G*  To  obtain  ths  branching  Bl  in  G*  ,  delete 
frow  H*  one  of  the  edges  e^  of  Q1  which  hes  aininn  weight  c^ 


11 


That  completes  the  description  of  the  algorithm.  Evidently  It  la  a 

0 

good  algorithm.  Evidently  Ita  output  la  a  branching  B  la  graph  G  .  In 

0 

order  to  prove  Theorem  1,  what  remains  to  be  dona  la  prove  that  B  haa 
■an tm  total  weight. 


§5 


Theorem  1  and  the  following  geometric  theorem  are  proven  together. 

Let  G  be  any  graph.  (No  edge-welgtta  ase  specified.)  Let  there-  be  a 

real  variable  x.  for  each  edge  e  «G  .  Let  P_  be  the  polyhedron  of 
J  J  C 

vectors  x  ■  [x^  ]  which  aatiafy  the  ayaten  L^.  ,  collating  of  inequalities 
1*1  *  ^2  >  1*3  * 

(Lj)  For  every  edge  *j*  G  ,  *j  >  0  . 

(Lj)  For  every  node  yc  G  ,  £  x^  <1  ,  aummed  over  ell  j'e  such  that 

e^  la  directed  toward  y  . 

(Lj)  For  every  aet  S  of  two  or  ire  nodee  in  G,  £  x^  |s|  -  1  , 

aummed  over  all  j'e  euch  that  e^  has  both  ends  in  •  ♦  .(  |sf 

denotes  the  cardinality  of  S  .  ) 

Any  vector  x  -  (x^  ]  of  aeroea  and  ones  it  called  the  vector 

of  the  subset  of  e^'s  such  that  x^  •  1  . 

Theorem  2 .  The  vertices  of  polyhedron  are  precisely  the  vectors  gf 
the  aubeets  of  edges  in  C  which  comprise  branch Inga. 
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A  polyhedron  (Cfflgj  ggllbtfesa)  p  !•  the  Mt  of  *11  th*  vector*, 

1.*.,  point* .  which  satisfy  soaa  flnlt*  syataa  L  of  llnsar  inequalities. 

A  vertax  (extren*  point)  of  P  i*  *  point  which,  for  sow*  limar  function, 

1*  th*  unique  point  in  P  which  aaxlaiaas  that  function. 

A  batlc  point  x  »  *°  of  a  finite  syataa  L  of  linear  inequalities 
la  the  unique  aolution  of  a  ayatan,  ^E  a^  ,  )  «J,  such  that 

jE  *lj  *t  S  ^  i  J  '  J  i  11  1  aubayata*  of  L  . 

If  haalc  point  x®  of  L  la  in  th*  polyhedron  P  of  L  ,  than  it  la 

0 

a  vortex  of  P  ,  because  clearly  x  la  than  the  unique  point  in  P  which 
nexinixaa  ^  (^E  a^  )  ,  J  •  J  . 

o 

We  shall  eee  without  difficulty  that  any  point  x  ,  which  la  the  vector 

0  0 
of  a  branching  *ay  >  in  G  ,  la  a  vertex  of  Pg  .  Vector  x  aatlaflea 

Lj  since  it  la  all  aaroaa  and  onaa .  Vector  x°  satisfies  for  any 

nod*  y«  C  ,  since,  by  the  definition  of  branching,  at  scat  one  of  th*  x^'a 

la  this  Inequality  haa  value  1  for  *0  . 

0 

The  branching  B  is  a  forest,  so  any  set  S  of  nodes,  together  with 

0  0 

the  subset  lj  of  the  edges  in  1  which  have  both  ends  in  S  ,  fores  a 

forest.  Th*  nueber  of  edges  in  a  forest  la  at  neat  th*  n unbar  of  aodaa  In  the 

foreat  minus  1  ;  In  particular,  )l^|  <  |tf  -  1  ,  Therefore,  vector  a® 

satisfies  Lj  for  any  subset  S  of  (two  or  nor*)  nodes  in  G  ,  since  jt®{ 

u 

of  the  Xj'a  in  this  inequality  have  the  value  1  for  x  Subbm rising  th* 

0 

conclusion  so  far,  x  la  a  point  in  P„  . 

V 
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Vector  x  i»  the  unique  solution  of  the  llneer  system:  x^  •  0 

for  every  edge  e^  not  in  B°  ,  end  E  x^  -  1  (sumnd  over  e^'s  directed 
toward  v  )  for  every  node  v  which  has  some  edge  of  B®  directed  toward 


it.  This  system  can  be  obtained  from  certain  of  the  relations  of  and 

Uj  by  replacing  their  inequality  signs.  Therefore  x^  is  a  basic  point 

of  L_  .  end  hence  a  vertex  of  P„ 

G  G 

Most  of  this  paper  is  directed  toward  proving: 


Le« s  1:  Every  linear  function,  ^  cj  xj  (  «—d  over  all  edges  e.cG), 
is  maximised  in  by  the  vector  of  soon  branching  in  G 

Prom  Lemma  1  and  from  the  definition  of  vertex,  it  follows  immediately 
that  every  vertex  of  P_  is  the  vector  f  a  branching  in  G  .  This  will 
conclude  the  $r»ol  of  Theorem  2  . 

Q 

A  branch ins  g  in  graph  G  has  maximum  total  weight  relative  to  the 
vector  c  *  [Cj ]  of  edge'Weights  if  and  only  If  the  vector  x®  •  [x®]  of  1® 
maximises  (c,*)  »  ^E  x^  over  all  vectors  of  branchings  in  C  .  If  x° 

maximises  (c,x)  over  ,  then  it  mexialses  (c,x)  over  the  vectors  of 

branchings  in  G  ,  since  the  letter  ere  in  Pg 

Out  teak,  therefore,  la  to  ahow  that  the  vector  of  the  hremching  1®  , 
produced  by  the  algorithm,  maximises  (c,x)  over  P.  .  This  will  prove 

tf 

that  the  algorithm  la  valid  and  will  prove  Lemma  1  . 


*6 

The  following  computations  are  well-known  in  linear  programing. 


Suppose  that 

x  »  [x^]  is  any  vector  which  satisfies 

(5) 

X|  >  0  for  every  £  ,  and 

(6) 

tT  Ui  \  -  bn  for  every  n  ' 

and  that  y  ■ 

[y  1  is  any  vector  which  satisfies 

V 

(7) 

y^  >  0  for  every  r)  ,  and 

(8) 

vz‘tv  yfl-c{  £or'v"y  i  ■ 

Since  (6)  and  (7)  imply 


w  ..’VuiW  *EVti  "  (b'y)  ’ 

and  since  (5)  and  (8)  imply 

(10)  ’  (c'x)  ' 

we  have 


(ID 


<c,x)  <  (b,y)  . 


IS 


0  0 

Since  (11)  holds  for  any  x  end  any  y  ,  if  (c,x  )  *  (b,y  )  ho  Ida 

0  0  0  , 
for  particular  x  ■  x  and  y  •  y  ,  then  x  must  aaxiaiae  (c,x)  end 

0 

y  auat  minimise  (b,y)  . 

Suppose  for  particular  x  ■  x1  and  y  ■  y1  that 

(12)  *  E  a*  xl  »  b  for  tj  such  that  y*  t  0  , 

5  5”  5  v  V 


I!  a  v  1 

v  tv  m  c£  for  4  such  that  t  o 


Since  (12)  implies  equality  in  (9)^  and  (13)  lilies  equality  in 
(10)  ,  we  have  (c,  x1)  •  (b,  yl)  .  Therefore, 


(14)  x  maximises  (c,x)  and 

y^  minimises  (b,y) 

Our  present  interest  is  where  (5)  is  (L^)  ,  and  (6)  is  (Lg)  and  (L^). 

For  any  linear  function  (c,x)  ■  jE  x^  of  points  x  «  ^  ,  hi  fat  a  dual 
system,  (7),  (8),  (b,y),  by  letting  a  variable  y^  correspond  to  each  inequality 
of  Lg  and  That  is  let  a  variable  y^  correspond  to  each  node  s G 

and  let  a  variable  y#  correspond  to  each  set  V  of  two  or  more  nodes  in  G  . 

For  (7)  we  have, 


for  every  ,  y^  >  0  ,  and 


for  every  8  ,  y§  >  0  . 
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Coefficient  -  1  if  edge  a.  i*  directed  toward  node  y.  ,  end 
Jn  j  “ 

a  ■  0  otherwise.  Coefficient  a  *1  if  edge  e.  has  both  ends  in  S  , 
Jh  J*  J 

and  a^#  -  0  otherwise.  For  every  ,  b^  -  1  •  *or  •v#rF  s  » 

k,  -  M-l  • 

Therefore ,  (8)  becoaes 


(17) 


for  every  edge  e^sG  , 


Function  (b,y)  becoaes 


y.  +  w,  >  c  ,  where  y.  is  the  front-end 
h  )  -  J  h 

of  e^  ,  and  where  w^  ■  E  y§  ,  sunned  over 
all  sets  S  which  contain  both  amis  of  e 


J  ‘ 


o>,y>  -  hs  yh  +  .s  <isl-1>y«  * 

sunned  over  all  |/^  and  over  all  S 

Recall  that  our  task  is  to  show  that  the  vector  *°  of  the  branching 
1°  ,  produced  by  the  algorltha,  aaxlaises  (c,x)  over  Pg  . 

In  view  of  (14),  we  do  so  by  constructing  a  vector  y  •  ly^,  y#]  which 
satisfies  (15),  (16),  (17),  and  which  satisfies  (12)  and  (13).  For  the  pte- 
eent  systen,  (12)  is 

(18)  for  every  node  y^  such  that  yh  4  0  , 

•  1  ,  sunns d  over  J’s  such  that 
is  directed  toward  J  and 
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(19) 


for  every  set  S  ouch  that  yg  ?■  0  ,  E  x®  ■  js|-l  , 
•tuned  over  J's  such  that  e^  has  both  ends  in  S  . 


In  other  words,  (18)  says  that  if  y^  ^  0  than  m  edge  of  the  branch¬ 
ing  B°  is  directed  toward  y^  ,  and  (19)  says  that  if  jg  t  0  then  exactly 
| S | — 1  edges  of  B®  have  both  ends  in  S  . 


For  the  present  systea,  (13)  is 


(20) 


0 

for  every  edge  e^  in  the  branching  B  , 

yfa  +  Wj  ■  Cj  ,  where  y^  and  w^  are  as  in  (17). 


Si 


For  each  graph  Gi(i  -  k,  k-l,...,0)  with  weight  cj  on  each  edge 

ej  «G*  ,  and  for  the  branching  B*  in  G1  ,  we  will  describe  a  vector 

y1  which  satisfies  (15)  -  (20)  ,  where  G  and  B°  are  replaced  by  G1  and 

B1  and  where  vector  y  is  yl  . 

k  i+1 

First  we  describe  a  y  ,  and  then,  earning  ay  (i  ■  k-l,...,0),  we 
describe  a  y1  .  Thus  by  induction  we  obtain  a  y  ■  y®  and  the  proof  of 
Theorewe  1  and  2  . 

k  k  k  k 

The  vector  y  “  (y^  »  y$]  ie  y  •  0  for  every  set  S  of  two  or  aore 

nodes  In  Gk  ,  yf  ■  0  for  every  node  y*  1°  G*  which  has  no  edge  of  B*1 
o  b 

k  k  k  k 

directed  toward  it,  and,  for  every  other  node  y^  in  G  ,  y^  *  c^  where 
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V  k  k 

edge  e^  of  B  la  directed  toward  y^ 


Conditions  (IS)  -  (20)  for 


v  1c  lc 

y  can  be  iaaediately  verified  from  the  fact  that  for  every  node  ^  e  G 

k  k 

either  there  is  no  edge  of  B  directed  toward  y  ^  and  there  is  no  posi- 

k 

tlvely  weighted  edge  directed  toward  y£  ,  or  else,  aaong  all  the  poaitlvely 

k  k 

weighted  edges  directed  toward  ^  ,  the  one  in  B  has  oaxinuB  weight. 

Now,  suppose  that  we  have  a  y*+*  for  each  node  y^1  and  a  y*+* 


for  each  set  S  of  two  or  sore  nodes  in  G 


i+1 


,  such  that  (15)  -  (20)  are 


satisfied  (where  B®  is  replaced  by  B*+*  ,  etc.). 


t  *  .1+1  -  1+1 

Ut  C  h  "  E  ys 


d  over  the  sets  S  which  contain  node 


i+1 


To  make  the  induct  ion  go  through  we  assuae  further  that  in  G 


i+1 


(21) 


for  every  node  y^  ,  such  that  tfe  +  yh  >  0  , 
exists  at  least  one  edge  e^  directed  toward 
such  that  c .  -  t^  +  y^  . 


there 

^h 


k 

This  clearly  holds  for  G  ,  and  we  will  prove  froa  (15)  -  (21)  for 
Gi+l  that  (15)  -  (21)  holds  for  G1  . 

Obtain  the  vector  yl  as  follows. 

Where  A  is  the  set  of  nodes  in  circuit  Q*  of  G*-  ,  where  e^  1* 
the  edge  of  Q*  not  in  B1  ,  where  y^  is  the  front  end  of  e^  ,  where 
Cq  is  the  ainlaua  weight  In  Q1  ,  and  where  y^+1  is  the  node  in  Gl+1 
to  which  Q*  was  shrunk,  i.  . 


(22) 


,  and 
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(23) 


1  1  „i+l 

c2  -  y2  -  tx 


Where  v\,  ie  eny  node  in  A  other  then  y^  ,  end  where  e^  is  the 
edge  in  Q*  which  ie  directed  toward  y*  ,  let 


(24) 


i  i  i  ..i+1 
^3  *  C4  ‘  yA  ’  ‘i 


Obeerve  that  (24)  holds  also  for  v2  m  v2  * 

Where  y*  is  any  node  of  G*  which  is  not  in  ,  let 


(25) 


i+1 


Where  R  is  a  non  empty  subset  of  nodes  in  G1+1  which  does  not 
contain  yj+*  ,  where  J  ■  RUyJ+*  ,  where  K  **  RUA  ,  and  where  L  is 
eny  set  of  two  or  more  nodes  in  G*  such  that  i/lA  is  e  proper  subset  of 
A  ,  let 


(26) 

(27) 

(28) 


i+1 


l+i  . 

yT  ,  •«* 


K  "  0  * 


Thet  completes  the  description  of  vector  y  .  Mow  we  must  verify  (15) 


(21)  for  it 
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For  every  edge  of  G  which  ia  dlractad  toward  a  noda  not  in  A  , 
for  avary  noda  not  In  A  ,  and  for  avary  aat  8  ,  except  A  f  in  G*  , 
conditions  (15)  -  (18) ,  (20),  and  (21)  follow  leaadlataly  froa  thoaa  aaaa 
conditions  for  yt+*  ,  (25)  -  (28),  and  tha  local  nature  of  tha  change  froa 


.i+l 


.i+1 


,  and  c 


i+l 


co  0 


B  ,  and  c 


For  avary  subset  of  nodes  in  G  which  does  not  contain  all  of  A  , 
condition  (19)  follows  Imadlataly  as  above.  For  aat  A  and  for  avary  sat 
K  as  in  (27),  condition  (19)  follows  froa  (27),  condition  (19)  for  sat  J 
in  ,  and  tha  fact  that  there  are  exactly  |k|  -  |j|  ■  Ja|  -  1  wore 

edges  of  B*  with  both  ends  in  K  than  there  are  adgea  of  B*+*  with  both 
ends  in  J  ,  namely  tha  edges  of  8*  A  Q1, 

It  follows  from  (24),  (27),  and  (28),  that  (21)  holds  for  avary  noda 
h  A  (in  particular  where  a^  is  tha  a*  of  (24)  ),  and  that  (20)  holds 

for  avary  edge  cf  B*  fl  Q1 ,  and  that  (17)  holds  for  e*. 

Condition  (18)  follows  iamsdiately  for  each  noda  of  A  except  since 
there  is  an  edge  of  B1  A  Q*  dlractad  toward  it.  If  there  la  an  edge  e*+* 
in  B1+*  which  is  directed  toward  ,  than  a*  la  an  edge  of  B*  which 

is  directed  toward  ,  and  so  in  this  case  (18)  follows  for  .  Other¬ 
wise,  if  there  Is  no  edge  of  8 1+1  dlractad  toward  ,  than  by  (18)  for 

,  yj“+*  ■  0  .  Also  in  this  case,  tha  c^  of  (22)  was  chosen  in  tha 

algorithm  to  be  c .  Therefore,  if  there  is  no  edge  of  B1*^  directed 
toward  ,  then  (22)  is  y^  -  0  ,  and  so  (18)  follows  for  . 
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For  a*  ,  tha  only  edge.  If  any,  which  la  In  B*  -  Q and  dlracCad 
Coward  a  noda  In  A  ,  wa  hava  c*+1-  -  c*  +  c*  -  (fron  (1)),  (22), 
y*+1+  w*+1  -  c[+1  which  la  (20)  for  a*+*  >  and  w*  •  w^+*  from  (27)  and 
(28)  .  Combining  Chaaa  wa  get  +  W1  "  cl  >  (20)  for  a*  . 

Thu«  condition*  (18),  (19),  (20),  and  (21)  arn  now  complataly  accountad 
for.  Condition  (17)  for  adgaa  not  In  Q1  but  diractad  toward  nodaa  in  A  , 
condition  (16)  for  y*  ,  and  condition  (15)  for  nodaa  In  A  ,  remain  to  ba 
verified. 

Lat  a*  ba  any  adga  of  Gl  which  haa  both  anda  In  A  ,  and  lat  »»* 

ba  lta  front  and.  To  prova  (17)  for  a^  ,  which  la  y*  +  w*  >  c*  whara 
w*  -  y*  +  t*+1  ,  combine  (24)  and  c^  >  c*  . 

Lat  a*  ba  any  adga  of  G*  which  haa  lta  front  and  ^  In  A  and 

lta  mr  and  not  in  A  .  To  prova  condition  (17)  for  a*  > which  la 

yj  wj  >  c*  whara  w*  -  w*+1  ,  combine  (24),  (23),  (22),  (l)(and  (17)  for 
1+1 

*3  * 

To  prova  (16)  for  A  ,  that  la  y^  >  0  ,  wa  uaa  (21)  for  . 

Aaaunlng  t**1  +  >  0  ,  lat  e*+i  ba  tha  a^  of  that  ralatlon,  lat  Vj 

ba  tha  front  and  of  a*  In  A  ,  and  lat  a^  ba  tha  adga  of  Ql  which  la 

diractad  toward  y*  .  Kara  (21)  la  e*>l  -  t**1  +  y**1  .  In  Chla  caaa, 

obtain  yA  £  0  by  combining  (23),  (22),  (21)  for  v\*1  ,  (I),  and  (4)  . 
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If  thara  la  to  a*  diraetad  toward  auch  that 

c*+1  -  t^+1  +  y*+l  ,  than  t*+1  +  y^+1  -  0  ,  and  all  adgaa  diraetad  toward 

hava  negative  weight  In  6*^  ,  ao  nona  of  thaa  ara  In  ,  Thara - 

fora  a Inc a  In  thla  caaa  tha  e^  of  (22)  waa  choaan  to  ba  c^  ,  (22)  bacoaMa 

yj  -  0  ,  and  (23)  bacoaaa  yjj  -  ej  .  By  (2),  wa  hava  yjj  0  . 

Prova  (15)  for  any  noda  In  A  by  combining  (24),  (23),  (22),  (3) 

and  y*+*>  0  • 

That  coaplata*  tha  proof  of  Thaoraaa  1  and  2  . 

|8 

Hot  lea  froa  tha  proof  that  If  avary  weight  ,  a^*  0  ,  la  an  lntagar, 

than  tha  vactor  y°  ,  aa  wall  aa  vector  *°  ,  la  Intagar-valuad.  In  partlcu- 

0 

lay,  where  avary  ■  l  ,  vector  y  la  0,1  -  valued  and  awx(c,x)  ■  nln(b,y) 
la  a  a  lap  la  "itonlg-typa"  thaorau,  uialogoua  to  tha  war  1im-c  ordinal  It  y-aatchlng 
duality  tbiawa  to  [1]. 

Tha  following  two  thaoraaa  can  ba  proved  by  tha  aathoda  need  bare . 

Thaaraa  3  .  Watt  (L4)  U  E  a,  -  n  ,  83S3L  i ill  ifctti  •  •  # 

jstat  XMLLk&iA  slL  *ul  nlitoAti*  Una  ix  <li)»a<^ '3> •  (V  mU*Ux 

tha  vac  tor  a  of  tha  a -cardinality  lubaait  of  1b  0  whleh  tatlH 

branehlnaa.  (^  aaitlCttlgg,  f|tf|  >  ifOL  iftli  Utt  tit.  M*K  1 L  M tfeft 
ti  0  #  that  feMSkiill  KJL  til  ftteRtil  KtglMttHI  fil  0  >  • 
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The  present  research  began  when  A.J.  Goldman  asked  for  a  description 
of  "the  convex  hull  of  the  spanning  trees  of  a  gr^ph."  Theorem  4  la  proved 
in  [3], 

Theorem  4  .  The  vertices  of  the  polyhedron  F^  given  by  (L^)  and  (L^) 
are  precisely  the  vectora  of  the  subseta  of  edges  In  G  which  comprise 
forests .  The  vert  Ices  of  the  Intersection  of  F^  with  (L^)  are  a  eubaet  of 
Che  vertices  of  F_ 
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I.  The  Optimum  Assignment  Problem 

For  rectangular  array  (matrix),  N,  ve  define  a  matching 
in  N  to  be  a  subset  M  of  the  positions  in  N  such  that  each 
column  and  each  row  of  N  contains  at  most  one  member  of  M  . 

For  any  square  array  N  ,  we  define  a  transversal  or  a  perfect 
matching  M  to  be  a  subset  of  the  positions  in  N  such  that  each 
column  and  each  row  of  N  contains  exactly  one  member  of  M  , 

The  optimum  assignment  problem  is, 
given  any  n  x  n  array  N  of  real 
numbers,  find  in  N  a  transversal  the 
sum  of  whose  entries  is  maximum,  i.e., 
an  "optimum"  transversal. 

A  transversal  in  N  "assigns"  the  rows  of  N  to  the  columns 
of  N  .  Where  the  columns  are  people  and  the  rows  are  jobs,  and 
where  each  numerical  entry  represents  the  value  of  the  person  of  that 
column  at  the  Job  of  that  row,  an  optimum  transversal  represents 
an  optimum  assignment  of  the  people  to  the  Jobs. 


A  well-known  generalization  of  the  assignment  problem  Is 
the  Integer  Hitchcock -transportation  problem:  Given  a  rectangular 
array  H  of  real  numbers,  c^  ,  and  given  an  Integer  a^^  >  0 
for  each  row  1  and  an  Integer  b^  >  o  for  each  column  J  ,  assign 
a  non-negative  Integer  x^  to  each  position  (l,j)  so  that 

(1)  for  every  1  ,  -  a1  , 

(2)  for  every  J  ,  =  b^  , 

and  so  that  ^ Jc^x^  Ininimum  (or  maximum). 

If  represents  the  number  of  refrigerators  available  at 

factory  1  ,  and  b^  represents  the  number  of  refrigerators 

ordered  by  dealer  J  ,  and  c^  represents  the  cost  of  shipping  a 

refrigerator  from  1  to  J  ,  then  ,  JJ:. ,x  ,  represents  the 

lyj  lj  1J 

total  coat  of  the  particular  manner  [x^]  of  distributing  the 
refrigerators . 

The  assignment  problem  is  where  all  aj  -  1  and  all  bj  -  1  . 

A  minor  variation  of  the  assignment  problem  Is:  given  a 
rectangular  array  N  of  real  numbers  find  in  H  a  matching  whose 
entries  have  maximum  sum.  This  variation  corresponds  to  replacing 
the  equality  signs  in  (1/  and  (2)  by  Inequality  signs.  Ofcourse, 
a  maximum  matching  will  not  contain  a  position  whose  entry  Is 
negative.  In  particular  if  all  the  entries  are  negative  then  the 
maximum  matching  will  be  the  empty  matching.  It  is  an  Interesting 
exercise  to  discover  how  any  maximum  transversal  problem  can  be  solved 
by  solving  a  maximum  matching  problem,  and  vice  versa. 
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There  are  other  generalizations  and  variations  of  the  optimum 
assignment  problem  —  most  notably  integer  network  flow  problems. 

These  lectures,  after  treating  the  assignment  problem  itself  will 
deal  (briefly,  I’m  afraid)  with  some  bizarre  variations. 

In  an  n  x  n  array  there  are  n!  different  transversals.  In 
particular  there  100  J  ways  to  assign  100  people  to  100  Jobs. 

100  J  is  very  large.  If  our  method  for  finding  an  optimum  assign¬ 
ment  spent  one  microsecond  per  possible  assignment,  it  would  take 
hundreds  of  years  to  cptimally  assign  100  men. 

It  is  a  remarkable  fact  there  exists  a  consistently  good 
algorithm.  An  algorithm  good  enough  that  you  could  actually  do  sb 
homework  any  instance  of  the  assignment  problem  with  100  people,  100  Jobs, 
and  any  collection  of  3  digit  numbers  as  values.  Good  enough  to  be 
used  for  many  thousands  of  people  and  Jobs.  Of course  you  have  to 
know  how,  and  it  is  not  easy  to  discover  how. 

It  is  an  unfortunate  fact  for  most  combinatorial  problems  — 
problems  very  similar  to  the  assignment  problem  --  that  good  algorithms 
are  not  known.  For  most  such  problems,  though  they  are  of course 
finite,  the  best  known  methods  do  considerably  worse  than  one  might 
expect.  Such  problems  include  the  bulk  of  Integer  linear  programming 
problems . 

Therefore  we  do  what  we  can.  We  experiment  with  tht  most 
promising  methods  we  can  find  for  problems  that  need  answers.  And 
we  also  try  to  find  classes  of  problems  for  which,  using  special 
methods, we  can  predict  computational  efficiency.  These  lectures  fall 
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nto  the  latter  area. 


For  ease  in  giving  an  arm-waving  description  of  an  algorithm, 
ii  is  convenient  to  represent  the  assignment  problem  by  a  graph. 

A  bipartite  graph  G  is  one  whose  set  V  of  nodes  partitions 
into  two  sets  and  Vg  so  that  no  edge  of  G  has  both  ends  in 

the  same  set.  Thus,  each  edge  meets  one  node  in  and  one  node 

in  Vg  .  Denote  the  set  of  edges  of  G  as  E  . 

A  matching  M  in  a  graph 
is  subset  of  its  edges  such  that 
no  two  members  meet  the  same  node. 

A  perfect  matching  in  a  graph  is  a  subset  of  its  edges  such  that 
exactly  one  member  meets  each  node. 

The  optimum  assignment  problem  (more  precisely,  a  minor 
generalisation  of  it)  is: 

In  any  given  bipartite  G  ,  with  a  real  numerical  weight  cft 

for  each  edge  e*E  ,  find  if  there  is  one  a  perfect  matching  M 

which  maximises  £  c  »!•«•"«  maximum  perfect  matching". 
e<M  e 

After  we  treat  the  above  problem,  we  shall  treat  the  same 
problem  where  G  is  not  necessarily  bipartite.  The  latter  is  a 
very  substantial  generalisation. 

Where  0  is  bipartite,  with  "parts"  and  Vg  ,  the  nodes 
of  V^  correspond  to  rows  or  Jobs  the  nodes  of  Vg  correspond  to 
columns  or  people.  Each  edge  corresponds  to  a  position  in  the  array. 


-  1*  - 


Any  perfect  matching  in  G  determines  an  assignment  of  people  to 
jobs,  as  does  a  perfect  matching  in  the  array. 


Where  G  is  any  graph  not  necessarily  bipurtite,  the  maximum 
perfect  matching  problem  is  the  problem  of  optimally  pairing-off 
a  set  of  objects  (the  nodes) .  Admissible  pairs  and  their  values 
are  represented  by  the  edges. 

We  shall  see  later  that  various  other  problems  reduce  to  the 
matching  problem. 

A  feasible  node -weighting  of  graph  G  is  a  vector  [y  ] 
with  a  component  yy  for  each  node  v^V  such  that,  for  every 
edge  efE  , 

(3)  y  +  y  >  c  where  u  and  w  are  the  ends  of  e  . 

•'u  v  -  e 

Lemma  1.  For  any  perfect  matching  M  of  a  graph  G  and  for  any 
feasible  node -weighting  [y  ]  , 


(u)  Z  c  <  £  y  . 
ecM  VfV 


Proof:  Add  up  the  inequalities  (3)  which  correspond  to  the  edges 
in  M  . 


It  follows  from  Lemma  1  that  if  ve  can  find  a  perfect  matching  M 
and  a  feasible  fyv]  such  that  equality  holds  in  (4),  then 

£  c  must  be  maximum  and  £  y  must  be  minimum. 
e{M  6  V(V  V 


This  is  true  Aether  or  not  0  is  bipartite.  When  one  can 
get  such  a  node -weighting  along  with  an  M  ,  it  provides  a  very 
simple  guarantee  thst  the  M  is  maximum. 


If  0  is  not  bipartite,  there  does  not  necessarily  exist  a 

feasible  [y  ]  such  that  jy  equals  the  maximum  value  of  j  c 
v  T  efM  * 


For  any  given  edge-weighted  bipartite  graph  G  ,  the  optimum 
assignment  algorithm,  that  we  will  describe,  first  chooses  any 
feasible  node -weighting  [y  ]  .  Then  it  chooses  a  matching,  not 
necessarily  perfect.  Then  it  successively  finds  better  node- 
welghtlngs  and  better  matchings  until 

(l)  it  finds  a  node -weighting  and  a  perfect  matching  for  which 

E  c„  ■  E  yv  i  0T  until 
e<M  VfV  v 


(2) 


it  finds  a  way  to  choose  node -weightings  such  that 
(in  which  case,  by  Lemma  1,  there  is  no  perfect  os 


tching  in 


0). 


Thus,  the  algorithm  will  prove 

Theorem  2.  For  any  edge -weighted  bipartite  graph  G  ,  which  contains 
a  perfect  matching,  the  maximum  welght-sia  of  a  perfect  matching  in  0 
the  minimum  sum  of  a  feasible  node -weighting. 


equals 
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P.S.  1  If  G  contains  no  perfect  matching,  then  there  is  no  minimus 


feasible  node-weight  sum. 

By  using  only  integer  node-weights,  if  edge-weights  are 
integers,  the  algorithm  will  also  prove: 


P.S.  2  If  the  edge -weights  are  all  integers,  then  a  min  node 
weighting  can  be  chosen  to  be  integers.  a'/x 

The  two  postscript  as  well  as 
the  theorem,  are  false  for  non- 
bipartite  graphs . 
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II.  A  Hungarian  Method 


We  back  up  nov  and  state  Theorem  1,  vhich  will  be  uBed  as 
part  of  the  optimum  assignment,  algorithm,  to  prove  Theorem  2 
already  given. 

A  subset  H  of  the  nodes  in  a  graph  G  is  called  hungarlan, 
relative  to  G  ,  if  no  two  of  them  are  Joined  by  an  edge  of  G  and 
if  the  set  H(H)  f  neighbors  of  H  has  fewer  members  than  H  , 
i.e.,  |If ( H)  j  <  j H |  A  neighbor  of  H  is  a  node  which  is  Joined 
by  an  edge  of  G  to  a  node  in  H  . 


Obviously,  a  graph  which  contains  a  Hungarian  set  H  can  not 
Contain  a  perfect  matching  M  ,  because  the  | H J  leabers  of  M 
which  aeet  the  nodes  In  H  would  have  to  meet  j H {  different 
aeabera  of  H(H)  .  This  is  impossible  since  |H ( H) !  <.  |H|  . 

A  non-bipart ite  graph  does  not  necessarily  contain  either  a 
hungarlan  set  or  a  perfect  matching. 

Theorem  1.  A  bipartite  graph  G  contains  nc  perfect  matching  if 
and  only  if  0  has  a  hungarlan  set,  contained  either  in  part 
or  part  V?  cf  G  . 
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Subroutine  R1  of  the  algorithm  will  prove  Thm  1  by  finding 
in  any  bipartite  graph  either  a  perfect  matching  or  a  hungarian 
set.  Subroutine  R2  of  the  algorithm  will  prove  Thm  2  by  using 
a  hungarian  set  of  a  subgraph  of  G  to  improve  any  non-minimum 
node -weighting  of  G  . 

Here  is  the  algorithm,  Given  a  bipartite  graph  G  with  a 
numerical  weight  ca  on  each  edge  e(E  . 

Give  to  G  any  feasible  node -weighting  fyv]  •  (Make  the 
node-weights  integers  if  the  edge-weights  are.) 

For  the  current  feasible  node -weighting  at  any  stage  of  the 
algorithm,  let  G'  denote  the  subgraph  of  G  which  consists  of 
all  nodes  of  G  and  those  edges  e  of  G  such  that 

(5)  y  +  y  =  c  ,  where  u  and  w  are  the  ends  of  e  . 
u  v  e 

We  call  G'  the  equal ity  subgraph  of  G  relative  to  fy  ]  ■ 

Using  routine  Rl,  we  find  either  a  perfect  :.vatching  M  in  0’ 
or  else  a  hungarian  8et  H’  of  G '  in  either  part  or  V?  . 

(Since  every  edge  of  G  has  or.e  of  its  end -nodes  in  set  and 

its  other  end -nod*  in  set  Vg  ,  the  same  is  true  for  subgraph  G'  . 
Any  perfect  matching  of  G'  is  of course  also  a  perfect  matching 
of  G  ,  since  G*  contains  all  th*  nodes  of  G  .  However,  s 
Hungarian  set  H'  of  G'  la  not  in  general  a  hungarian  set  of  G  . 
Since  G  generally  has  more  edges  than  G'  ,  set  H'  of  nodes 
generally  has  more  neighbors  relative  to  G  than  relative  to  GV.) 
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Theorem  1  saye  that  O'  has  either  an  M  or  an  H'.  Routine  R1 
will  be  the  proof  of  Theorem  1. 

Suppose  we  find  &  perfect  matching  M  in  O'  .  Then  adding 

together  equations  (?),  y  +  yy  =  cg  ,  for  the  edges  e  in  M  , 

we  get  £  c  *  £  y  ,  end  so  M  is  optimum. 
e*M  e  V(V  v 

Otherwise,  we  find  a  hungarian  et  Y‘  of  G'  contained  in, 
say,  part  .  In  thiB  car  ,  we  epply  routine  R2: 

R2.  Suppose  that  the  set  K(H')  ,  of  neighbors  of  H’  relative  to 
G’,  is  not  the  entire  set  of  neighbors  of  H1  relative  to  G  .  Then 
there  are  edges  e  of  G  which  are  not  in  G’  and  which  have  one 
end  in  H’  and  the  other  end  not  in  N(H’)  .  Let  c  =  min  (yu+yv  - 
over  all  such  edges.  Clearly,  (  >  0  .  Lowering  each  node -weight  in 
H1  by  e  and  raising  each  node -weight  in  H(H')  by  e  ,  we  get  a 
new  feasible  node -weighting.  Its  sum  is  smaller  because  (N(H')J  <  |K| 
The  equality  subgraph  G'  changes,  so  we  apply  R1  again. 


Where  H(H')  is  the  entire  set  of  neighbors  of  H',  relative 
to  G  as  well  as  G',  the  set  H'  is  hungarian  relative  to  G  and 
so  there  is  no  perfect  matching  in  G  .  In  this  case,  we  can  take  e 


to  be  as  large  as  ve  please,  and  still  change  node -weights  as  above. 
This  gives  as  small  as  we  please,  i.e.  . 

Assuming  Thm  1  holds,  i.e.,  assuming  there  is  a  valid  routine 
Rl,  the  routine  R2  just  described  proves  Thm  2  and  PS1.  For  a 
node -weighting  having  min  sum,  there  must  be  a  perfect  matching 
in  the  corresponding  G'  ,  because  otherwise  ve  can  apply  Thm  1 
and  R2  to  get  a  smaller  node -weight  sum. 

P5£  asserted  that  if  edge -weights  are  integers  then  a  min 
node -weighting  can  be  chosen  to  be  integers.  This  follows  by 
applying  the  above  process  to  any  integer-valued  feasible  node- 
weighting,  because  whenever  edge-weights  and  node-weights  are 
integers.  <  1r.  an  integer  (or  arbitrary  -  «  ) . 

Let  us  new  describe  routine  Rl  ,  thereby  proving  Thm  1  and 
completing  the  description  of  an  optimum -assignment  algorithm.  KL 
must  find  either  a  perfect  matching  in  G’  or  a  hungarien  set  of  G' 

Rl;  Let  M  be  any  matching  in  ?'  ,  not  necessarily  a  perfect 
matching.  To  begin  it  might  be  the  empty  matching.  If  M  is  not 
perfect,  let  r  be  any  node  which  M  does  not  meet.  We  shall  find 
either  a  hungarian  set  of  G'  which  contains  r  ,  or  else  a  matching 
M'  in  O'  which  is  better  than  M  in  the  sense  that  K'  meets 
r  as  well  as  all  the  nodes  which  M'  meets. 
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In  O'  ,  "grow  a  tree"  T  of  the  kind  pictured  inside  the 

bgan 

-  erect.  Node  r  itseli  a  tree  of  type  T  .  Start  off  with 


simply  it. 


edge  of  O'  ,  not  in  T  ,  and 
not  in  M  . 
edge  of  M  . 

edge  of  T  ,  not  in  M  . 
called  an  inner  node  of  T  . 
called  an  outer  node  of  T  . 


All  the  outer  nodes  of  T  must  he  in  the  sane  part  of  O'  . 

Thus,  no  two  at  them  are  Joined  by  a  edge  of  O'  .  The  number  of 
inner  nodes  of  T  is  exactly  one  less  than  the  number  of  outer  nodes. 


Por  any  T  either  (a),  (b),  or  (c)  holds,  (a)  Every  edge  in  O' 
which  meets  an  cuter  node  of  T  meets  an  inner  node  of  T  at  its 
other  end.  In  this  case,  the  set  of  outer  nodes  is  a  hungarian  set  B 
of  O'  .  The  set  of  inner  nodes  is  N(H)  ,  relative  to  O'  . 
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Otherwise  sane  edge  e^  of  G'  meets  an  outer  node,  say  vq  , 

of  T  and  a  node,  say  u  ,  not  in  T  .  (b)  u  meets  no  edge  in  M  . 

In  this  case  obtain  M'  fran  M  by  interchanging  the  matching 

roles  of  edges  in  the  path  P  ,  consisting  of  u  ,  eQ  ,  and  the 

path  from  vq  to  r  in  T  .  Then  forget  T  and,  if  there  is 

still  some  node,  say  r'  ,  in  G'  which  does  not  meet  any  edge 

of  M'  ,  grow  in  G’  another  tree  rooted  at  r'  .  (c)  Otherwise, 

u  meets  an  edge  e^fM  .  In  this  case,  enlarge  T  by  adjoining 

e  and  e,  to  it,  so  that  u  becomes  an  inner  node  and  the 
o  1 

other  end  of  becomes  an  outer  node. 

Suppose  the  number  of  nodes  in  G  ,  and  thus  in  G'  ,  is  2n  . 
After  T  is  enlarged  at  moat  n  times,  either  (a)  or  (b)  must  occur. 
After  at  most  n  differently-rooted  trees  are  grown,  either  (a) 
or  a  perfect  matching  must  occur  in  O’  . 

When  (a)  occurs,  we  apply  R?  to  the  hungarian  set  H  of  G’  , 

the  set  of  outer  nodes  in  T  .  Thus,  unless  H  is  also  a  hungarian 

set  relative  to  G  ,  the  node  weights  change  and  the  equality 

subgraph  G'  changes.  The  outer  nodes  of  T  are  then  not  a 

hungarian  set  of  the  new  O'  .  However,  the  same  matching  M  is 

contained  in  the  new  G '  and  the  sane  tree  T  is  contained  in  the 

new  G'  .  Case  (b)  or  (c)  of  R1  applies  directly  to  this  M  and 

T  in  the  new  G'  ,  and  so  we  can  continue  growing  the  same  tree  in 

2 

the  new  G*.  Thus,  HI  is  iterated  s  total  of  at  most  n  times 

and  the  "step"  of  enlarging  a  tree  is  iterated  a  total  of  at  moet 

2 

n  times  before  one  obtains  either  a  perfect  matching  in  some  equality 
subgraph  G'  or  else  a  hungarian  set  of  G  . 
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III.  Bipartite  Matching  and  Linear  Programs 


By  polyhedron  (strictly  speaking  convex  polyhedron)  ve  mean  the 
set  of  all  vectors  (points)  which  satisfy  some  given  finite  collection 
of  linear  equations  and  linear  inequalities . 

Let  0  be  any  finite  graph,  not  necessarily  bipartite.  Let  B 
denote  the  set  of  edges  of  graph  0  .  Let  V  denote  the  set  of  nodes 
of  0  . 


Let  there  be  a  real  variable  xfi  for  each  edge  efE  .  Let  PQ 
be  the  polyhedron  of  vectors  [xft]  such  that 


(1) 

for 

every 

e«E 

,  xg  ^  0  ,  and 

(2) 

for 

every 

VCV 

,  t  i  =  1  ,  where  the  sum  is  taken  over 
© 

all  edges  e  which  meet  node  v  . 

Another  way  of  expressing  (l)  and  (2)  is 
(1 *)  >  0  »  Jf*  >  and 

(2’)  j I  ajjXj  -  »  Jf*  >  lfV  , 

where  all  b^  »  1  ,  where  a^  -  1  if  edge  J  meets  node  1  ,  and 

where  a  »  0  If  edge  J  does  not  meet  node  1  . 

*•  J 

Matrix  [a^j]  Is  called  the  incidence  matrix  of  graph  G  .  It  has 
a  "row”  1  for  each  node  i«V  and  a  "column"  J  for  each  edge  . 

Clearly,  a  matrix  is  the  incidence  matrix  of  same  graph  if  and  only  if 
each  of  its  columns  contains  exactly  two  l's  and  the  rest  C's. 
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We  may  define  a  vertex  x°  of  a  polyhedron  P  to  be  a  point 

(i.e.,  a  vector  [x®]  =  x°)  in  P  such  that,  for  some  linear 
J 

o 

function  U  =  ,£c  x  of  the  points  in  P  ,  x  is  the  only  point 
J  J  J 

in  P  which  maximizes  the  function. 

It  is  a  standard  theorem  that  if  a  given  linear  function  of 
points  in  polyhedron  P  has  a  maximum  (i.e.,  P  is  not  empty  and 
the  function  is  not  unbounded  above),  then  the  function  is  maximized 
by  a  vertex  of  P  .  Ofcourse,  some  linear  functions  on  P  are 
maximized  by  other  points  as  well. 

As  you  know,  the  linear  programming  problem  is:  For  the  polyhedron  P  , 
determined  by  a  given  system  of  linear  "constraints",  and  for  a  given 
linear  function  U  of  points  in  P  ,  find  a  vertex  of  P  which  maximizes 
(or  minimizes)  U  in  P  . 

Any  vector  [xe]  of  zeroes  and  ones  is  called  the  incidence  vector 
of  (or  simply  the  vector  of)  the  subset  of  e's  such  that  xg  *  1  . 

Thus,  every  subset  of  edges  in  G  is  represented  by  a  unique  0,1 
vector,  and  conversely. 

Clearly,  where  c^  is  the  weight  on  edge  e  in  G  ,  the  weight -sum 
of  any  perfect  matching  M  in  G  is  the  value  of  («<E)  fo r  the 

vector  of  M  . 

Clearly,  the  0,1 -valued  vectors  contained  in  P_  (in  fact,  the 

G 

integer-valued  vectors  contained  in  P^)  are  precisely  the  vectors  of 
perfect  matchings  in  G  . 


We  shall  show  that  the  assignment  problem  is  an  Instance  of 
linear  programming  by  shoving  that 

Theorem  3«  If  0  Is  bipartite,  then  the  vertices  of  P^  are  precisely 
the  vectors  of  perfect  matchings  In  0  . 

Clearly,  even  if  0  is  not  bipartite,  the  vector  of  any  perfect 
matching  M  in  G  is  a  vertex  of  PQ  ,  since  ve  can  display  a 
function  TPexe  which  obviously  is  maximised  in  P^  only  by  the  vector 
of  M  .  In  particular,  vhere  ce  =  I  If  and  only  if  • 

The  hard  part  is  to  shov  that  every  vertex  of  Pq  is  the  vector  of 
a  matching.  We  shall  do  so  using  the  duality  thm  of  linear  programming 
and  thm  t.  about  node -weightings  for  an  edge -weighted  bipartite  0  . 

The  i-p .  dual  of  maximising  U  *  jjjc^Xj  ,  subject  to  x^  >  0  and 
-  \>i  ,  is  minimising  W  -  1Jj»ly1  subject  to  >  Cj  • 

The  duality  thm  says  that  max  U  *  min  V  if  these  extrema  exist. 

In  particular,  the  dual  of  maximising  B- jc  x  (e«S)  in  P_  is 
minimising  W  -  (v*V)  subject  to  yu+yw  >  cg  for  every  e  ,  vhere 

u  and  v  are  the  end -nodes  of  e  .  That  is,  minimising  the  sum  of 
feasible  node -weights ,  as  in  Thm  2. 

Thus,  it  follows  from  Thm  2  and  the  4. p.  duality  thm  that  if  0 
is  bipartite,  then,  for  any  U  ,  the  max  of  U  for  vectors  in  Pq  equals 
the  max  of  U  for  vectors  of  perfect  matchings. 
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Therefore,  since  all  vectors  of  perfect  matchings  are  in  PQ  , 

U  is  maximized  in  P„  by  the  vector  of  a  perfect  matching. 

yj 

For  any  vertex  x°  of  P^  ,  suppose  that  U  is  a  linear  function 
that  is  maximized  in  ?„  only  at  x°  .  Then  x°  must  be  the  vector 
of  a  perfect  matching.  So  Thm  3  is  proved. 

Conversely,  Thm  3  and  the  £.p.  duality  thm  immediately  imply  Thm  2. 

So,  in  view  of  /.p.  duality,  Thm  2  and  Thm  3  are  equivalent. 

Where  bipartite  graph  G  is  a  square  array  whose  rows  i  and 
columns  j  are  the  nodes,  and  whose  positions  (i,j)  are  the  edges, 

Thm  3  is  well-known  as  G.  Birkhoff's  theorem  on  "doubly  stochastic 
matrices"  (19^6) 

An  n  by  n  doubly  stochastic  matrix  is  defined  to  be  an  n  by  n  matrix 
[x1j]  such  that: 

all  x^  >  0  ,  and 

(3)  for  every  fixed  i  ,  =  1  >  8111(1 

(k)  for  every  fixed  J  ,  =  1  •  (8e«  ) 

Matrices  are  vectors,  indexed  differently.  The  collection  of  n 
by  n  doubly  stochastic  matrices  is  a  polyhedron.  The  Birkhoff  thm 
says  that  the  vertices  of  this  polyhedron  are  the  n  by  n  permutation 
matrices.  A  permutation  matrix  is  a  matrix  such  that  there  is  a  1  in 
each  row,  a  1  in  each  column,  and  all  other  entries  are  zeroes. 

Thm  2,  essentially,  is  due  to  Egervary  ( 1931 ) -  The  algorithm  here 
for  the  assignment  problem,  essentially,  is  due  to  Kuhn  and  Munkres  (1955-57). 
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Where  the  1  In  (3)  i*  replaced  by  any  prescribed  integers  a^  >  0 
and  the  1  in  (b)  le  replaced  by  any  prescribed  integers  b,  >  0  , 

V  " 

we  get  the  linear  constraints  of  the  integer  transportation  problem, 
relations  (l)  and  (2)  of  section  I  .  These  constraints,  together 
with  x^  >  0  for  every  i  and  J  ,  define  a  polyhedron,  say  P^,  . 

Theorem  3  readily  generalizes  to  the  fact  that  all  the  vectlces 
of  Pj  are  integer-valued  rectors.  Thus  we  have  the  very  well- 
known  fact  that  the  integer  transportation  problem  is  an  Instance 
of  linear  programming. 

Indeed,  historically  the  first,  and  still  the  moat  prominent, 
algorithms  for  the  transportation  problem  sure  direct  applications 
of  the  simplex  method. 

Theorem  3>  and  consequently  Theorem  2,  and  consequently  even 
Theorem  1,  are  readily  (and  often)  proved  using  general  / .p .  techniques 
together  with  the  special  properties  of  the  incidence  matrix  [a^j] 
of  a  bipartite  graph. 

Similarly,  extremal  (integer)  flows  in  s  network  can  be  treated 
by  applying  general  j.p.  techniques  to  matrices  [•y]  which  are 
the  "incidence  matrices  of  directed  graphe".  And,  on  the  other  hand, 
the  combinatorial  algorithm  that  we  described  for  the  assignment 
problem  is  very  closely  related  to  the  combinatorial  methods  of  Ford 
and  Fulkerson  for  network  flow  problems. 

Two  advantages  of  these  combinatorial  methods  are  (l)  for 
practical  purposes,  they  are  more  efficient  than  simplex  methods,  and 
(2)  for  theoretical  purposes,  they  provide  theoretical  bounds  on 
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efficiency  of  computation  that  are  so  for  not  available  for  single* 
methods.  The  disadvantage  of  these  combinatorial  methods  is  that 
they  have  not  been  satisfactorily  extended  to  general  i. p. 
problems . 
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IV.  Matching  in  a  general  graph 


So  far  these  lectures  have  been  snail  variations  on  old  stuff. 

How  I  would  like  to  sho '  one  of  the  vmya  In  which  these  combinatorial 
methods  can  be  generalised  to  certain  problems  which  can  not  be 
treated  directly  as  linear  programs  --  at  least  not  In  the  usual 
sense.  Actually  they  are  linear  programs  determined  illicitly 
by  astronomical  collections  of  linear  constraints.  Surprisingly, 
these  problems  are  just  as  tractable  as  the  assignment  problem  -- 
in  spite  of  the  traditional  views  as  to  why  the  assignment  problem 
Is  so  tractable. 

We  treat  the  problem  of  finding  a  maximum-weight  perfect  matching 
In  an  edge-weighted  graph  Q  which  is  not  necessarily  bipartite.  In 
particular  we  can  take  G  to  be  a  complete  graph  --  a  graph  in 
which  every  pair  of  nodes  is  joined  by  an  edge. 

Recall  that  we  have  already  defined  a  polyhedron  PQ  ,  for  any 
graph  G  ,  by  the  linear  contraints  (l)  and  (2)  in  section  III.  We 
have  already  observed  that  the  only  integer-valued  vectors  contained 
In  P_  ere  the  vectors  of  perfect  matchings  in  G  .  That  is,  an 
integer-valued  vector  [a  ]  is  contained  in  PQ  if  and  only  if  its 
components  are  O’ a  and  l’s  ,  and  the  1 -components  correspond  to 
the  edges  of  a  perfect  matching. 

We  have  already  observed  that  where  c  is  the  weight  on  e 

£ 

in  0  ,  the  weigh*. -sum  of  any  perfect  matching  M  is  the  value  of 
£  c^x^ejE)  for  the  vector  of  H  . 
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Hence,  the  maximum  perfect -matching  problem,  for  any  edge- 
weighted  graph  G  is  an  instance  of  "integer  linear  programming". 
Integer  linear  programming  Is  the  problem  of  maximizing  a  given 
linear  function  by  an  integer-valued  vector  subject  to  some 
given  linear  conttraints. 

If  all  the  vertices  of  polyhedron  P^  were  vectors  of  matchings, 
as  ir.  the  case  where  G  Is  bipartite ,  then  the  naximum  matching 
problem  would  be  simply  an  instance  of  linear  programming.  However, 

has  vertices  which  are  fractional- 

The  picture  shows  one  of  the 
sin^lest  graphs  G  such  that  P_ 

u 

contains  a  fractional  vertex  as 

well  as  several  vertices  which 

are  vectors  of  perfect  matchings. 

The  numbers  on  the  edges  are  the 

components  of  the  fractional  vertex 

in  this  Pq  .  Notice  that  where  the  edge-weights  are  the  numbers 

on  the  edges  in  the  picture  on  page  (O  ,  the  maximum -weight  of  a 

perfect  matching  is  l8.  At  the  fractional  vertex  of  P^,  ,  £  c^x^ 

e«E 

has  the  value  19,  which  by  no  accident  is  also  the  min  sure 
of  a  feasible  node -weighting. 


when  G  is  not  biDartite,  P_ 

Cj 

valued . 


A  main  idea  in  tackling  the  general  perfect-matching  problem 
is  to  chop  off  the  fractional  vertices  of  so  as  to  obtain  a 
polyhedron  such  that  all  the  vectors  of  perfect  matchings  are 
still  contained  in  P '  and  such  that  P*  doesn't  have  any 
fractional  vertices,  i.e.,  all  its  vertices  are  vectors  of 
perfect  matchings.  In  other  words,  obtain  the  convex  hull, -  P^  , 
of  the  vectors  of  perfect,  matchings. 

Theorem  K  .  For  any  graph  G  ,  the  convex  hull  of  the  vectors  of 
perfect  matchings  in  G  is  the  polyhedron  P^  given  by  the 
following  linear  constraints: 

(1)  for  every  edge  efE:  xfi  >  0  ; 

(2)  for  every  node  VfV:  £  xg  =  1  ,  where  the  sum  is  taken  over 

all  edges  e  which  meet  node  v  . 

(3)  for  every  subset  s  of  nodes  which  has  cardinality  je|  -  2qfl  +  1 
for  some  positive  integer  q  t  £  x  c  a  ,  where  the  sum  is  taken 
over  all  edges  e  which  have  both  ends  in  a  . 

Any  vector  of  a  matching,  say  M  ,  in  G  satisfies  (3)  for 

any  set  s  ,  since  no  more  than  q  edges  of  M  can  have  both 

ends  in  s  ,  and  since  such  edges  are  the  only  ones  In  M  which 

appear  in  (3).  Therefore,  every  vector  of  a  perfect  matching  of 

G  is  in  the  polyhedron  PI  . 

u 

It  is  not  so  obvious  that  every  vertex  of  P '  is  a  vector  of 

u 

a  perfect  matching.  We  prove  this  by  means  of  an  algorithm  --  in 
a  manner  analogous  to  our  proof  of  Thm  3  by  means  of  the  assignment 
algorithm . 
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The  idea  of  treating  a  combinatorial  problem  by  chopping 
away  at  a  polyhedron  to  eliminate  undesirable  vertices  is  by  no 
meant  new.  A  long  time  ago  Kuhn  and  Dar.tzig  ar.d  others  took  this 
approach  to  the  traveling  salesman  problem.  I  believe  Motgkin 
tried  it  for  the  "^-dimensional  assignment  problem”.  Gcssery 
discovered  finite  algorithms  for  integer  linear  programming  which 
operate  by  chapping  locally  until  the  answer  is  a  vert-ex  of  the 
resulting  polyhedron.  His  methods  could  be  adapted  to  give  finite 
algorithms  for  describing  the  convex  hull  of  the  integer  vectors  in 
any  given  bounded  polyhedron. 

For  the  matching  problem,  and  certain  cousins,  the  chopping- 
idea  has  been  completely  successful  in  the  following  senses,  (l) 

We  get.  a  succinct  and  useful  description  of  precisely  the  relevant 
polyhedra.  (2)  We  get  an  algorithm  that  is  really  good. 

Later,  I'll  describe  a  representative,  ,T optimum  branching" ,  of 
one  other  essentially  different  class  of  problems  for  which  polyhedron¬ 
chopping  has  been  completely  successful. 

Let  U  =  £  cgxe(efE)  be  any  linear  function  of  vectors  [xg]  , 
determined  by  an  arbitrary  specification  of  edge-weights  for  G  . 

To  maximize  U  by  a  vertex  of  is  a  linear  program.  Our 

purpose  is  to  solve  this  £.p.  by  the  vector  of  a  perfect  matching. 

Since  U  is  arbitrary  and  since  for  every  vertex  of  P'  there  is 
a  U  which  is  maximized  only  by  that  vertex,  this  will  prove  Thm  U  . 
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Vte  now  describe  the  dual  of  our  £.p.  Like  £. p.  duals  everywhere, 
it  has  a  variable  for  each  constraint  of  the  primal,  other  than 
non -negativity,  and  it  has  a  constraint  for  every  variable  of  the 
primal,  as  well  as  the  non-negative  constraint  on  each  dual 
variable  that  corresponds  to  an  inequality -constraint  of  the  primal. 

In  particular,  it  has  a  variable  yy  for  each  node  v*V  ,  the 
"node -weight"  for  v  .  And  it  has  variable  yg  ,  an  "odd-set-weight", 
for  each  subset  s  c  V  such  that  | s |  =  2qg  +  1  where  qg  is  a 
positive  Integer.  Pecall  that  |sj  means  the  number  of  nodes  in  s  . 

The  variables  y^  are  allowed  to  go  negative  since  they 
correspond  to  equations  of  the  primal.  The  constraints  of  the  dual  are 

(U)  for  every  set  s  ,  y  >  0  ; 

S 

(5)  for  every  edge  e  ,  ffi(y)  =  y„  +  yw  +  T  yg  >  ce  > 

where  u  and  w  are  the  ends  of  e  ,  and  where  the  summation  Is 
over  all  sets  s  which  contain  both  ends  of  e  . 

Thus,  a  dual  weighting  y  =  [yy>yB]  called  feasible  if  (U) 
and  (5)  hold. 

The  linear  functJ  on  to  be  minimized  is 

(6)  V  -  £  yv  *  E  Va  • 

The  duality  theorem  tells  us  that  a  vector  x°  *  [x°  ]  maximites 
U  *  £  cftxe  subject  to  (l),  (2),  and  (3)  if  and  only  if  there  is 
some  vector  y°  =  [y°>y?]  satisfying  (U)  and  (5)>  for  which  U(x°)  *  W(y°). 
We  shall  find  such  an  x°  which  is  the  vector  of  a  perfect  matching. 
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More  directly  useful  to  cur  purpose  is  the  so-called  "coople- 

« 

raentary  slackness  theorem  on  dual  jj.p.'s.  For  our  particular  dual 
£.p.'s  it  says  that  a  vector  x°  =  [x°]  maximizes  U  subject  to 
(l),  (2),  and  (3),  and  a  vector  y°  =  [y°>y°]  minimizes  W  subject 
to  (4)  and  (5),  if  and  only  if 

(7)  for  every  variable  y  ,  either  y°  =  0  or  else  equality  holds 

in  the  corresponding  constraint  (3); 

(8)  for  every  variable  x  ,  either  x°  =  0  or  else  equality 

e  e 

holds  in  the  corresponding  constraint  (5)- 

Where  x°  =  [x°]  is  the  vector  of  a  perfect  matching  M  of  G  , 

(7)  says  that  for  every  set  s  of  nodes,  either  y°  =  0  or 

s 

else  q  edges  of  M  have  both  ends  in  s  ; 

8 

(8)  says  that  for  every  edge  e  in  M  ,  equality  holdB  in 
the  corresponding  constraint  (5)- 

The  matching  algorithm  finds  a  perf ect  matching  M  and  a 
vector  y°  =  [y°  ,y°]  such  that  (U) , (5) , (7) ,  and  (8)  hold.  This 
guarantees  that  M  is  optimum  and  it  also  proves  Thm  4. 

Vectors  [yv>yfl]  have  an  awful  lot  of  components.  Fortunately, 
however,  the  ones  we  deal  with  have  no  more  non-zero  components 
than  the  number  of  edges  in  G  . 


V .  A  Matching  Algorithm 


Given  any  edge -weighted  graph  G  .  The  algorithm  start a  out 
Just  like  for  the  bipartite  case.  Me  choose  a  feasible  node- 
veighting  [yT]  ,  i.e.,  values  of  yy  such  that  (5)  holds  with 
all  y  *  0  .  Let  G'  be  the  equality  subgraph  of  G  ,  relative 
to  this  [yv,yg]  .  That  is,  O'  consists  of  all  nodes  of  G 
and  thoee  edges  e  for  which  equality  holds  in  the  corresponding 
constraint  (5),  f  =  cg  . 

If  we  can  find  a  perfect  matching  in  G'  then  it  will  be 
optimum  because  then  it  and  the  dual  weights  will  satisfy  (U),  (5), 
(7),  and  (8).  Generally  we  will  not  be  able  to  find  a  perfect 
matching  In  this  G'  or  in  any  other  Buch  O'  determined  by  a 
node -weighting.  However,  lets  try  to,  Just  as  in  the  bipartite 
case . 


Chooae  any  matching  M  ,  not  necessarily  perfect,  in  G'  .  If 
there  is  a  node  r  which  M  doesn't  meet,  start  growing  in  G' 
a  tree  T  rooted  at  r  ,  just  as  we  do  for  the  bipartite  case. 

Recall  that  whsn  G  is  bipartite,  either  (a),  (b),  or  (c) 
must  hold  for  T  in  O'.  (See  Section  II). 

Bipartite  or  not,  vhen  we  spot  an  occurrence  of  (c),  we  enlarge 
T  in  O'  .  Bipartite  or  not,  when  ve  spot  an  occurrence  of  (b), 
we  get  a  better  matching  in  C  ,  we  discard  the  current  T  in  O'  , 
and  if  the  matching  is  still  not  perfect  we  start  another  tree  T  . 
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In  case  (a),  the  outer  nodes  of  T  comprise  a  hungarian 

set  H  relative  to  and  the  inner  nodes  of  T  comprise  the 

set  N(H)  of  neighbors  of  H  relative  to  G'  .  Bipartite  or 

not,  when  (a)  occurs  we  change  the  dual  weighting  [y  ,y  ]  so 

that  other  edges  of  G  enter  G'  .  We  then  continue  to  treat 

the  same  T  relative  to  the  new  G'  .  In  general,  the  way  the 

dual-weighting  changes  is  more  complicated  than  for  a  bipartite  G 

where  we  don't  have  any  positive  weights  y  .  Indeed,  so  far  our 

s 

weights  yg  are  all  zero.  We  shall  have  to  describe  low  some  of 
them  become  positive.  We'll  do  so  after  we  describe  the  concept 
of  pseudo-node . 

When  G  is  not  bipartite,  a  fourth  case  (d)  can  occur: 

(d)  Two  outer  nodes,  say  v^  and  v^  ,  of  T  are  Joined 
by  an  edge,  say  e^  ,  of  G'. 

When  G  is  bipartite,  any  two  outer  nodes  of  T  mu.it  be  in 
the  same  part,  or  V?  ,  of  G  ,  and  so  (n)  can  not  occur. 

One  can  immediately  verify  that  for  any  G*  (regardless  of 
whether  G  ..a  bipartite),  either  (a),  (fc),  (c),  or  (d)  holds  for 


When  (d)  occurrs,  matters  get  especially  tricky. 


Let  be  the  path  in  T  from  back  to  r  .  Let  P^ 
be  the  path  in  T  from  Vg  back  to  r  .  Paths  P^  and  Pg  to¬ 
gether  with  the  edge  e^  Joining  v^  and  Vg  form  what  we  call 
a  flower. 


blossom  B  :  the  polygon.  Of course  if  P^  and  Pg  happen  not 
to  run  together  until  they  get  back  to  r  ,  then  t  =  r  and 
hence  the  stem  is  Just  the  node  r  .  It  is  also  possible  for 
t  *  v^  ,  or  even  t  *  r  *  v^  .  The  number  of  edges  and  the 
number  of  nodes  in  blossom  B  is  odd  and  greater  than  1,  i.e., 
2q  +  1  where  q  is  some  positive  integer. 

We  now  obtain  a  new  graph  G  ,  a  new  subgraph  G'  ,  and  a 
new  tree  T  ,  from  the  ones  we've  got,  by  shrinking  to  a  single 
pseudo -node ,  v^  ,  the  blossom  B  and  all  edges  that  have  both 
ends  in  B  . 
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The  edges  of  the  matching  M  that  are  not  shrunken  away  do 


form  a  new  matching  M  in  the  new  G'  .  The  new  T  ,  formed  by 

the  edges  of  the  old  T  which  are  not  shrunken  away,  i3  a  tree 

in  the  new  G '  having  the  correct  structure  relative  to  the  new  M 
The  pseudo-node  v^  is  an  outer-node  of  the  new  T  .  If  t  =  r  , 
then  is  the  root  of  the  new  T  . 

Whenever  we  shrink  a  blossom  we  remember  it  so  that  later  we 
can  expand  the  ^seudo-node  to  recover  it.  We  never  bother  tc 
remember  the  edges  of  the  current  matching  in  a  blossom  that  we 
shrink,  because  they  are  not  likely  to  be  compatible  with  the 
matching  that  is  current  when  we  expand  the  pseudo-node  back  to 
the  blossom. 

The  algorithm  continues  as  before,  considering  occurrances 
of  (a),  (b),  (c),  or  (d),  relative  to  the  new  G  ,  G’,  M,  and  T  . 

Each  time  we  spot  an  occurrence  of  (d),  we  shrink  the  blossom, 
thereby  obtaining  still  another  G,  G',  M,  and  T  .  A  blossom 
containing  pseudo-nodes  might  be  shrunken  into  another  pseudo-node, 
so  we  can  have  pseudo-nodes  "inside"  the  pseudo-nodes  of  G'  . 

The  set,  say  s  ,  of  all  real  nodes  inside  a  pseudo-node  always  has 
odd  cardinality,  because  the  sum  of  an  odd  number  of  odd  numbers 
is  odd.  We  remember  every  blossom  we  shrink  so  that  any  pseudo¬ 
node  can  be  expanded  anytime  that  it  is  not  inside  another  pseudo¬ 
node  . 
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Whenever  we  spot  an  occurrence  of  (c),  we  enlarge  the  T  in 
the  current  G'  . 

Whenever  we  spot  an  occurrence  of  (b),  we  get  a  new  mat Chios 
in  the  current  G'  such  that  fewer  nodes  in  G'  are  left  unmet 
by  the  new  matching.  When  this  happens  we  discard  T  .  If  there 
is  another  node  r  in  O'  still  not  met  by  the  matching,  we  start 
growing  in  the  same  G'  another  tree  rooted  at  that  r  .  Pseudo¬ 
nodes  of  G'  formed  from  blossoms  of  earlier  trees  may  become 
inner  nodes  or  outer  nodes  of  this  tree . 

Whenever  there  are  no  cases  of  (b),  (c),  or  (d)  to  spot,  we 
have  case  (a),  i.e.,  the  outer  nodes  of  T  are  a  hungerian  set  H 
relative  to  G'  and  the  inner  nodes  of  T  are  the  neighbor  set  N(H) 
of  H  relative  to  G'.  We  must  in  this  case  consider  changing 
the  dual  weighting  y  *  [yy,yB]  • 

In  general,  for  the  current  feasible  dual -weighting,  y  =  [y  y  ]  , 
a  ys  is  positive  only  if  s  is  the  set  of  real  nodes  inside  some 
pseudo-node,  either  a  pseudo-node  of  the  current  G'  or  a  pseudo- 
node  at  any  level  Inside  a  pseudo-node  of  the  current  G’  For 
every  edge  e  which  is  either  an  edge  of  G'  or  an  edge  of  a 

currently  shrunken  blossom,  we  have  equality,  f^(y)  -  ,  for  the 

constraint  (5)  corresponding  to  e  ;  conversely,  if  f  (y)  »  c  holds 
for  an  edge  e  of  the  current  G  ,  i.e.,  for  an  edge  e  that  Is 

not  inside  s  current  pseudo-node,  then  e  is  an  edge  of  G’  .  These 

ere  the  senses  in  which  a  pseudo  O'  is  the  equality  subgraph  of 
a  pseudo  G  . 
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Assuming  these  conditions  hold,  we  now  describe  how,  when  (a) 
holds,  to  get  a  new  feasible  dual -weighting  such  that  theae 
conditions  continue  to  hold  and  such  that  either  we  have  a  new  O’ 
relative  to  which  (b),  (c),  or  (d)  holds  for  T  ,  or  else  we 
dispose  of  a  pseudo  inner  node  of  T  ,  or  else  ve  have  V  -»  -  »  , 
in  which  case  there  is  no  perfect  matching  in  G  .  We  chooae  c 
to  be  as  large  as  possible  subject  to  the  following  constraints 
with  right-hand  sides  given  by  the  current  dual -weighting . 

(9)  For  every  edge  e  of  G  ,  not  in  G'  ,  such  that  one  end  of  e 
is  an  outer  node  of  T  and  the  ether  end  of  e  is  not  in  T  , 

<  <  fe(y)  -  ce  ■ 

(10)  For  every  edge  e  of  G  ,  not  in  G'  ,  such  that  both  ends 

of  e  are  outer  nodes  of  T  ,  ?t  <  f  (y)  -  c 

—  e  e 

(11)  For  every  e  which  1b  the  set  of  ell  real  nodes  inside  a 
pseudo  inner  node  of  T  ,  2(  <  y 

(The  pseudo  inner  no_.es  that  we  refer  to  here  are  nodes  of  the 
current  G  ,  not  pseudo  nodes  Inside  of  pseudo  nodes.) 

We  a«6ujae  for  the  manent  that  at  least  one  such  constraint 
exists,  so  that  c  has  a  maximuo. 

Rev  we  change  the  dual  weighting  rv  ,y  '  ss  follows.  For 
ev-ry  real  node  v  which  is  either  an  outer  n.de  of  T  or  else 
inside  a  paeudo  outer  node  of  T  ,  lover  y  by  (  .  For  every  s 

V 


which  la  the  set  of  all  real  nodes  inside  a  pseudo  outer  node  of  T  , 
raise  yg  by  2<  .  Por  every  real  node  v  which  is  either  an 

inner  node  of  T  or  else  ins'jc  an  inner  node  of  T  ,  raise 

yv  by  <  .  For  every  s  which  is  the  set  of  all  real  nodes 
inside  a  pseudo  inner  vertex  of  T  ,  lower  yg  by  2<  . 

Suppose  that  the  size  of  f  was  determined  by  equality  in 
scroe  instance  of  either  (9)  or  (10) .  Let  e  denote  the  corresponding 
edge.  After  the  { -adjustment  of  the  dual -weighting,  the  new  G' 
is  a  certain  different  subgraph  of  the  same  G  (perhaps  having 
pseudo-nodes).  Subgraph  G'  is  determined  by  G  and  the  new 
dual -weighting.  The  same  T  and  M  are  in  this  new  G'  .  The 
edge  e  enters  G '  .  If  only  one  end  of  e  is  an  cuter  node  of  T 
(and  the  other  end  not  in  T),  then  we  have  immediately  an  occurrence 

of  either  (b)  or  (c).  If  both  ends  of  e  are  outer  nodes  of  T  , 

then  we  immediately  have  an  occurrence  of  (d),  a  blossom  to  shrink 
as  previously  described. 

Suppose  the  site  of  {  was  determined  by  equality  In  sane 
instance  of  (]l)  .  Let  v  denote  the  corresponding  pseudo  inner 
nod*:.  Let  denote  the  edge  of  the  current  watching  which  meets  v 

Edge  e  is  In  T  .  Let  e^  denote  the  other  edge  of  T  which 
meets  v  .  After  the  {-adjustment  of  the  dual -weighting,  ve  expand 
to  tte  blosscm.  say  B  ,  whose  shrinking  introduced  v  .  This  expans  1 
give*  rise  to  a  new  G  and  G'  .  It  is  easy  to  verify  that  B  la 
pari  of  the  new  G'  .  That  la.  ^(y)  *  holds  for  the  edges  e 
of  B  .  Let  Yj  denote  the  node  of  B  which  edge  meets  •, 


let  v?  denote  the  node  of  B  which  meets.  The  only  node 
of  B  which  is  met  by  the  current  matching  is  v^  j  we  are  able 
to  add  to  the  matching  certain  edges  of  B  so  as  to  get  a  new 
matching  M,in  the  new  G'  ,  which  meets  all  the  nodes  of  B  . 

The  edges  of  the  current  T  remain  in  the  new  G'  .  Unless 

=  Vg  ,  they  do  not  form  a  tree  in  the  new  G*  .  However,  they, 

Uvcsrtftin  One  oflttsiutt  p*>tks  in  3  joinlno  UfcTe 
together  vitlM  minus  the  tifiw  mat  which  wss»i» 


,  do  form  our  new  tree  T  .  This  new  T  does  have  the  correct 
structure  relative  to  the  new  G'  and  new  M  .  Relative  to  this 
new  T  ,  M  ,  G'  ,  G  ,  and  [y  ,y  ]  >  ve  now  return  to  looking  for 
further  occurrences  of  (a),  (b),  (c),  or  (d) .  Incidentally,  in  ^ 
the  situation  we  'ust  treated,  i.e.,  where  the  size  of  e  was 
determined  by  equality  in  an  instance  of  (ll) ,  it  might  be  that 
<  =  0  .  This  isn’t  relevant  to  the  treatment. 


We  have  finished  describing  all  situations  of  the  algorithm, 


except  the  two  terminal  situations. 


One  of  the  terminal  situations  is  when  we  obtain  in  some  G'  , 
perhaps  with  pseudo  nodes,  a  matching  M  which  is  perfect.  We 
then  expand  pseudo  nodes  v  ,  one  after  another,  to  the  polygons  B 
that  they  represent.  Immediately  before  a  pseudo  node  v  is 
expanded,  the  matching  M  ,  that  we  have,  is  perfect  in  the  graph  G' 
with  with  node  v  ,  that  we  have.  let  e  denote  the  unique  edge 
of  M  which,  in  that  G'  ,  meets  node  v.  Expanding  v  to  polygon  B 
gives  us  a  larger  G'  containing  B  instead  of  v  .  In  this  G’  , 
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e  is  the  only  edge  of  M  that  meets  a  node  of  B  .  By  adjoining 
to  M  certain  edges  of  B  ,  we  obtain  a  perfect  matching  M  for 
this  larger  G'  .  Unless  there  are  no  more  pseudo-nodes,  we  then 
treat  some  pseudo-node  of  this  G'  ,  perhaps  one  in  B  ,  in  the 
same  wey. 

Eventually,  we  get  a  perfect  matching  M  in  the  original 
graph  G  .  It  will  be  an  optimum  perfect  matching,  because  it  and 
the  dual -weighting,  that  ve  have,  together  satisfy  conditions  (4), 
(5),  (7),  and  (8). 

The  other  terminal  situation  is  an  occurrence  of  (a)  for 
which  there  are  no  constraints  (9),  (10),  or  (ll).  In  this  case, 

C  can  be  chosen  as  large  as  we  please.  By  using  e  -»  <*>  to  change 
the  dual  weighting  [yy,y  ]  ,  as  already  described,  we  get  a 
feasible  [yv,ye]  8UcJl  +'bat  W  •  .  Hence,  there  can  be  no 
perfect  matching  in  G  . 

This  completes  the  description  of  the  algorithm.  Just  as  when 
G  is  bipartite,  we  can  observe  a  bound,  relative  to  the  size  of  G  , 
on  the  number  of  operations  in  applying  the  algorithm  to  G  ,  which 
dhows  the  algorithm  to  be  not  only  finite,  but  very  good.  At  the 
same  time,  Theorem  4  is  proved. 
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VI.  Theorems  of  Tutte,  Peterson,  and  Konig 


The  algorithm  also  provides  proof  of  the  following  theorem 
of  W.T.  Tutte  (19^7),  analogous  to  Theorem  1.  We  define  a 
Tutte  family  in  a  graph  G  to  he  a  family  of  disjoint  connected 
subgraphs  Gi  of  0  such  that  each  G^,  contains  an  odd  number 
of  nodes  (perhaps  one  node)  and  such  that,  upon  shrinking  every 
G^  to  a  node  v  ,  the  set  of  nodes  is  a  hungarian  set  of 

the  resulting  graph.  (Tutte  does  not  describe  the  family  in  this 
way.) 


Theorem  5-  A  graph  contains  a  perfect  matching  if  and  only  if  it 
does  not  contain  a  Tutte  family. 

The  "only  if"  part  is  fairly  easy.  If  G  contains  a  Tutte 
family  of  subgraphs  and  also  a  perfect  matching  M  ,  then, 
because  G.^  has  an  odd  number  of  nodes,  at  least  one  edge  say  e^  , 
of  M  has  one  end  in  G^  and  the  other  end  not  in  G^^  .  After 
shrinking,  the  edges  e^  meet  the  nodes  ,  and  have  distinct 
nodes  at  their  other  ends.  This  is  impossible,  however,  since 
the  set  of  nodes  v^  is  hungarian. 

To  prove  the  "if'1  part  recall  the  terminal  situation  of  the 
algorithm  where,  for  an  occurrence  of  (a),  there  are  no  constraints 
(9),  (10),  and  (ll).  The  absence  of  a  constraint  (ll),  means  that 
no  inner  node  of  T  is  pseudo.  Since  the  outer  nodes  of  T  are 
a  hungarian  set  H  of  G1  ,  the  absence  of  constraints  (9)  and  (10) 

means  that  H  is  also  a  hungarian  set  of  G  .  Its  neighbor  set 
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N(H)  consists  of  the  inner  nodes  of  T  ,  ell  real.  The  nodes 
in  H  ,  call  them  nodes  ,  are  obtained  by  shrinking  disjoint 

connected  subgraphs  of  the  original,  real-noded,  G  .  Each  G^ 

contains  an  odd  number  of  nodes.  Therefore,  the  original,  real- 
noded,  G  contains  a  Tutte  family. 

The  only  alternative  is  the  other  terminal  situation  of  the 
algorithm,  and  it  yields  a  perfect  matching.  Thus,  Tutte 's  theorem 
is  proved. 

Ofcourse,  a  much  more  pleasant  proof  can  be  obtained  by 
stripping -down  the  algorithm  to  one  for  simply  finding  in  any  G 
either  a  perfect  matching  or  a  Tutte  family.  Indeed,  finding  an 
algorithm  for  the  latter  was  a  main  hurdle  in  finding  the  algorithm 
that  maximizes  weight-sum.  Tutte’ s  original  proof  of  Theorem  5  ia 
fascinatingly  unalgorithmic,  and  it  prompted  a  number  of 
programmatic  efforts  on  the  subject. 

The  subject  of  matchings  started  over  75  years  ago  with  the 
U -color  map  conjecture.  The  conjecture,  still  unproved,  says  that 
for  any  way  of  dividing  up  the  plane  into  a  "map",  by  a  connected 
graph  (a  "planar"  one)  embedded  in  the  plane  so  that  every  edge 
lies  on  the  boundary  of  two  different  regions,  the  regions  can  be 
colored  with  only  four  colors  so  that  any  two  regions  having  an 
edge  in  common  are  colored  differently.  The  property  of  every 
edge  lying  on  the  boundary  of  two  different  regions  of  the  map  ia 
equivalent  to  the  planar  graph  containing  no  isthmus.  An  isthmus 
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of  a  connected  graph,  planar  or  not,  is  an  edge  whose  deletion 
leaves  the  graph  unconnected. 

By  "perturbing"at  each  node  the  conjecture  easily  reduces  to 
the  case  where  the  graph  has  degree  3  at  each  node.  The  degree 
of  a  graph  G  at  a  node  is  the  number  of  edge -ends  in  G  that 
meet  the  node . 

An  interesting  theorem,  which  we  won't  prove,  is  that: 

A  planar  map  having  degree  3  at  every  node  can  be  colored  (properly) 
with  b  colors  if  and  only  if  the  graph  of  the  map  contains  three 
mutually  disjoint  perfect  matchings.  Thus,  the  lt-color  conjecture 
is  equivalent  to  the  statement  that  any  3-degree,  connected,  planar 
graph  with  no  isthmus  contains  3  mutually  disjoint  perfect  matchings. 

In  1891,  Peterson  made  the  following  contribution  toward  proving 
the  l* -col or  conjecture. 

Theorem  6.  A  3-degree,  connected  graph  G  with  no  isthmus,  whether 
planar  or  not,  contains  a  perfect  matching. 

Let's  prove  this  using  Tutte's  theorem.  Suppose  a  graph  G 
as  described  in  Theorem  6  contains  no  perfect  matching.  Then  it 
contains  a  Tutte  family  of  subgraphs  G^  .  Since  G^  contains  an 
odd  number  of  nodes,  since  each  node  has  odd  degree,  and  since  the 
collection  of  edges  meeting  nodes  in  G^  has  collectively  an  even 
number  of  edge -ends,  the  number  of  edges  of  G  having  one  end  In  0^  , 
and  one  end  not  in  G^^  ,  is  odd. 
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Not  every  can  have  as  many  as  3  such  edges,  since  the 
shrunken  G^'s  are  a  hungarian  set  whose  smaller  neighbor  set 
must  meet  all  such  edges,  and  no  node  in  the  neighbor  set  can  meet 
more  than  3  such  edges.  Therefore,  there  is  at  least  one  subgraph 
such  that  exactly  one  edge,  say  e  ,  has  one  end  in  G^  and 
one  end  not  in  G^  .  Edge  e  is  then  an  isthmus  of  G  ,  contradicting 
the  hypothesis.  So  Theorem  6  is  proved. 

By  deleting  the  edges  of  a  perfect  matching  M  from  the  G 
o .  Theorem  6,  we  are  left  with  simply  a  collection  of  mutually 
disjoint  polygons.  Clearly,  the  set  af  edges  in  this  collection 
of  polygons  can  be  partitioned  into  two  perfect  matchings  if  and 
only  if  each  of  the  polygons  contains  an  even  number  of  edges. 
Considerable  effort  has  been  spent  on  trying  to  prove  that  when  G 
la  planar,  there  exists  an  M  such  that  G-M  consists  of  even 
polygons . 

If  the  G  of  Theorem  6  is  bipartite,  then,  for  any  M  ,  G-M 
consists  of  even  polygons,  because  it  is  easy  to  show  that  every 
polygon  in  a  bipartite  graph  is  even.  Thus,  the  U-color  conjecture 
is  proved  for  any  planar  map  whose  graph  G  is  3-d*grec  and 
bipartite . 

Indeed,  though  little  is  known  about  partltiioning  the  edges 
of  a  general  k -degree  graph  into  k  perfect  matchings,  we  do  have 
the  following  theorem  about  k -degree  bipartite  graphs.  The 
elementary  theory  of  bipartite  graphs,  including  this  theorem  and 
Theorem  1,  la  due  to  Xonlg  (circa  1925) • 


-  38  - 


Theorem  7.  The  edges  of  any  k-degree  bipartite  graph  can  be 
partitioned  into  k  perfect  matchings. 

For  any  perfect  matching  M  in  a  k-degree  bipartite  graph  0  , 
clearly  G-M  is  a  (k-l)  degree  bipartite  graph.  Hence,  it  suffices 
to  show  that  any  k-degree  bipartite  G  contains  an  M  .  If  G 
doesn't  contain  an  M  ,  then  ,  by  Theorem  1,  it  contains  a  hungarian 
set  H  .  Together  the  |H|  nodes  in  H  meet  k*  |H|  different 
edges.  At  the  other  ends  of  all  these  edges  is  N(H).  But  this 
is  impossible,  since  the  degree  of  each  node  in  H(H)  is  only  k  , 
and  N(H)  <  |H|  .  So  Theorem  7  is  proved. 

The  mystery  of  the  4 -color  conjecture  Beems  not  due  to  mystery 
about  planar  graphs  and  maps.  The  subject  of  "planarity"  is  very 
well  understood.  The  mystery  is  due  to  the  lack  of  a  satisfactory 
theory  about  the  combinatorics  of  coloring,  i.e.,  partitioning. 

If  you  could  find  a  good  algorithm  for  deciding,  for  any  given 

3- degree  graph  G  ,  whether  the  edges  of  G  can  be  partitioned 
into  3  perfect  matchings,  then  you  could  probably  settle  the 

4- color  conjecture. 


VII.  Degree -Constrained  Subgraphs 

Given  any  graph  0  vith  a  real  numerical  weight  cg  for  each 
edge  e<E  and  an  integer  by  for  each  node  VfV  ,  find  in  G  , 
if  there  is  one,  a  subgraph  M  which  has  degrees  by  at  nodes  v 
and  whose  edges  have  me:cimum  weight-sum.  This  is  called  the 
"optimum  b -matching  problem"  or  the  "optimum  degree -constrained 
subgraph  problem"  (for  "undirected  graphs"). 

Where  b  =  [by]  ,  VfV  ,  a  b -matching  M  in  G  is  a  subset 
M  cE  of  edges  such  that  by  edge -ends  of  edges  in  M  meet  node  v 
(Tutte  and  many  other  authors  in  graph  theory  would  say  "b-factor" 
rather  than  "b-matching" .)  Obviously  there  is  a  1-1  correspondence 
between  the  b-matchings  in  a  graph  G  and  the  b-degree  subgraphs 
of  G  that  contain  all  the  nodes  of  G  . 

Wt  allow  0  ,  and  hence  M  ,  to  contain  loops  and  multiple 
parallel  edges.  A  locp  is  an  edge  such  that  both  of  its  ends  meet 
the  same  node.  Several  edges  are  said  to  be  parallel  to  each  other 
if  they  all  meet  the  same  one  or  two  nodes.  Loops  and  multiple 
parallel  edges  are  superfluous  in  the  1 -matching  problem. 

Tutte  (195U)  generalised  his  Theorem  5  of  the  last  section  to 
a  characterisation  of  graphs  G  which,  for  given  b  ■  [by3  >  do  not 
contain  a  b -matching .  Using  rimilar  devices  we  shall  show  how  to 
get  a  good  algorithm  for  optimum  b-matching 
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In  fact,  ve  shall  generalize  further,  thereby  Including  directly 


the  integer  network  flow  of  problem  of  Ford  and  Fulkerson.  The 

latter  may  be  regarded  as  the  following:  Given  any  directed  graph 

(network)  G  with  a  real  numerical  weight  (coBt)  cg  for  each 

edge  (arc)  e<E  and  with  an  integer  bv  for  each  node  v^V  , 

find  in  G  ,  if  there  is  one,  a  subset  M  c  E  of  edges  such  that, 

for  every  v*V  ,  the  number  of  edge -ends  of  M  directed  toward  v 

minus  the  number  of  edge -ends  of  M  directed  away  from  v  equals 

b  ,  and  such  that  £  c  is  maximum  (or  minimum).  A  negative  b 
e^M  e  v 

is  called  a  source,  and  a  positive  by  is  called  a  sink. 

To  get  the  appropriate  common  generalization  of  undirected 
graph  and  directed  graph,  we  introduce  the  concept  of  "bidLirected 
graph".  A  graph  G  is  called  bldlrected  if  each  edge -end  of  G 
has  either  a  +1  or  a  -1  on  it.  Equivalently,  each  edge -end  Is 
directed  either  toward  or  away  from  the  node  it  meetB,  independently 
of  how  the  other  end  of  the  same  edge  is  directed.  Equivalently, 
each  end  of  an  edge  ir,  either  a  front-end  or  a  rear  end,  independently 
of  what  the  other  end  of  the  same  edge  is. 

The  degree  of  a  node  v  in  a  bidirected  graph  G  is  the 
number  of  front -ends  in  G  that  meet  v  minus  the  number  of  rear 
ends  in  G  that  meet  v  .  With  this  new  definition  of  degree, 
the  optimum  degree -constrained  subgraph  problem  is  the  same  as  stated 
above . 

A  bidirected  0  is  directed  if  every  edge  of  0  has  a  front 
end  and  a  rear  end.  A  bidirected  G  is  undirected  if  every  edge 
of  0  has  two  front  ends.  Another  interesting  case  is  where  every 


Ige  has  either  two  front  ends  or  two  rear  ends. 


Another  generalizati m  is  obtained  by  introducing  single  ended 
objects  called  slacks,  positive  slacks  and  negative  slacks,  i.e., 
front  slacks  and  rear  slacks.  A  slack  in  a  graph  meets  only  one 
node.  A  graph  G  with  slacks  is  regarded  as  undirected  when  all 
ends  in  G  including  those  of  slacks  are  front  ends. 

Slacks  conveniently  represent  upper  and  lower  bound  degree - 

constraints.  Suppose  we  wish  to  find  a  maximum  weight  subgraph 

M  of  G  such  that  the  degree  of  M  at  node  v  is  at  least 
1  2 

bv  and  at  most  by  .  In  other  words,  suppose  we  wish  to  find  in 

G  a  maximum  weight  b -matching  where  b  =  [b  ]  and  where  each 

1  2 

b  is  in  the  interval  b  <  b  <  b  .  Obtain  from  G  a  new  graph 

V  v  — ■  V  ~  V 

2  o 

Gq  by  introducing  at  each  node  v  of  G  ,  by  -  by  negative  slacks 

and  b°  -  b^  positive  slacks  where  b°  is  some  Integer  between 
1  2 

by  and  by  .  Give  these  alacka  weight  zero.  Finding  a  maximum 
weight  b°  -  matching  in  Gq  is  equivalent  to  finding  a  maximum 
weight  b-matchlng  In  G  . 

You  may  ask  why  I  don't  Introduce  "edges"  with  1  ends.  I  would 
If  I  knew  a  good  algorithm  for  handling  them. 

Another  valuable  generalization,  suggested  by  the  transportation 
and  integer  flow  problems,  is  to  maximize  U  T  c  x  (ecE)  by  an 

f  f 

t nteger-valued  vector  x  *  f xe ]  that  satisfies 

M)  for  every  element  e(S  .  either  an  edge  e  or  a  slack  e  , 

0  -  x  <  a  ; 

-  e  —  e 
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(2)  for  every  node  v  e  V, 

la  x  =  b  ,  (eeE),  where 

e  ev  e  v'  ' 

a  =  0  if  e  does  not  meet  v, 
ev 

a  =  1  if  e  has  one  front  end  at  v, 
ev 

a  =  -1  if  e  has  one  rear  end  at  v. 
ev 

a  =  2  if  e  is  a  loop  with  two  front  ends  at  v,  and 
ev 

a  =  -2  if  e  is  a  loop  with  two  rear  ends  at  v. 
ev 

Matrix  [a  ]  is  called  the  incidence  matrix  of  bidirected  graph 
G.  Set  V  is  the  set  of  nodes  ir.  G.  Set  E  is  the  set  of  edges,  includ¬ 
ing  loops,  and  also  the  slacks  in  G.  The  integer  i6  the  degree - 
constraint  at  v.  The  integer  is  called  the  capacity  on  e. 

When  every  Cg  is  1,  this  problem  is  simply  the  b-matching 
problem.  relative  to  graph  G.  When  the  Oe's  are  any  positive  integers, 
the  problem  is  the  b-matching  problem  relative  to  the  graph  obtained 
from  G  by  replicating  O g  times  the  element  e.  We  also  permit 
®e  *  **•  course,  if  the  problem  has  a  solution  it  must  be  possible 
to  replace  a  *  •  by  a  large  0^.  However,  as  a  matter  of  fact, 
infinite  capacities  are  much  easier  to  handle  than  finite  capacities. 

We  now  describe  how  any  optimum  b-matching  problem  can  be 
reduced  to  an  optimum  1 -matching  problem. 

We  describe  first  how  any  b-matching  problem  on  a  bidirected 
graph  G  can  be  reduced  to  a  b*-matching  problem  on  an  undirected 
graph  G*.  For  each  no^e  v  in  G,  let  there  be  two  redes,  say  u  and  w, 
in  G*.  Let  ail  the  front  er4s  at  v  be  front,  ends  at  u  in  G*.  Let 
all  the  rear  ends  at  v  be  front  ends  at  v  in  G*.  Let  there  be  in  G* 
a  new  edge  e  with  a  front  end  at  u,  a  front  end  at  w,  and  any 
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appropriately  large  capacity.  Let  the  degree  constraint?  b  and  b 

#  * 

be  appropriately  large  and  such  that  fc  -  =  b^_.  Let  every  edge  or 

slack  in  G  have  the  same  veight  in  G*  as  in  G.  Let  the  new  edges  ir 
G*  have  veight  zero. 

One  can  verify  that  if  G  is  directed  then  G*  is  bipartite. 

We  next  describe  hov  any  b-ssatching  problem  on  an  undirected 
graph  G  can  be  reduced  to  a  b* -matching  problem  on  an  undirected  graph 
G*  such  that  the  edge  capacities  are  all  •  and  such  that  there  are  no 
slacks. 

For  each  edge  e  of  G  having  finite  capacity  Cl ,  and  meeting  say 

nodes  u  and  v  in  G,  replace  e  by  a  path  Pg  of  three  edges  joining  u 

to  w;  give  the  tvo  new  nodes,  interior  to  P  ,  degree  constraints  equal 

to  0^.  Give  one  of  the  non-middle  edges  of  Pg  the  weight  tuat  e  had; 

give  zero  weignts  to  the  other  two  edges  of  P  .  Let  r  denote  a 

special  new  node.  For  each  slack  e  of  G,  having  capacity  a  ,  and 

c 

meeting  say  node  v  in  G,  replace  e  by  a  path  P  ol  two  »*iges  joining 

v  to  r;  give  the  new  node,  interior  to  path  P  ,  a  degree-constraint 

equal  to  a e  or  b^,  whichever  is  smaller.  Give  the  edge  of  P^  that 

meets  v  the  weight  that  slacn  e  had;  give  zero  weight,  to  the  ether 

edge  of  P  .  Let  there  be  a  zero-weighted  loop  with  both  (front)  ends 

at  r.  Give  r  any  appropriately  large  degree-constraint  whose  parity 

is  such  that  the  sum  of  the  degree -const rmint 8  on  all  nodes  is  even. 

The  result  of  this  construction  is  the  desired  b*  -matching  problem. 

an  undirected 

Kext  we  describe  how  to  reduce  any  b -matching  problem  on^  graph 
G,  ajch  that  there  are  no  slacss  and  such  every  edge  e  bar  capacity 
<*e  *  •,  to  a  perfect  matching  problem  (iise  treated  ir.  section  V)  on 


a  graph,  say  G*.  For  every  node  v  lr.  G,  having  degree -constraint  by, 
moke  copies  of  v  to  be  nodes  of  G*.  Join  a  pair  of  nodes  of  G*  by 
an  edge,  say  e*,  if  and  only  if  these  tvo  nodes  are  copieB  of  nodes 
in  G  that  are  Joined  by  an  edge,  say  e,  of  G.  Let  the  weight  of  the 
e*  in  G*  equal  the  weight  of  the  e  in  G.  Let  every  r.  de  in  G*  have 
degree-constraint  equal  1.  The  pre-image  in  G,  -ith  appropriate 
multiplicities,  of  an  optimum  perfect  matching  in  G*,  is  an  answer 
to  the  given  b-natching  problem. 

To  use  these  reductions  directly  as  algorittas  is  computationally 
rather  wasteful.  However,  one  can  use  them  to  derive  a  good  direct 
algorithm  for  b-matching  from  the  algorithm  for  1-matching. 

A  paper  to  appear,  called  "Optimum  degree -constrained  subgraphs," 
by  Kills  Johnson  and  me,  will  describe  a  direct  algorithm,  theorems 
analogous  co  Theorem  4,  and  computational  experience,  for  general 
optimum  fc-matchings. 

The  matching  algorithm  described  m  section  V  wi  devised  in 
1^62.  Since  then  I’ve  tried  to  find  good  algorithms  for  some  other 
obviously  finite  problems.  I  have  found  a  few,  but  it  seems  that 
such  algorithms  are  n jt  easy  to  come  by. 
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A  mitroid  M  is  a  finite  set  W  of  elements  with  a  family  of  subsets,  called  independent,  such  that 
lit  every  subset  of  an  independent  set  is  independent,  and  (2)  for  every  subset  A  of  M,  all  maximal 
independent  subsets  of  A  have  thr  same  cardinality,  called  the  rank  r[A)  of  A.  It  is  proved  that  a 
matroid  can  be  partitioned  into  as  few  as  k  sets,  each  independent,  if  and  only  if  every  subaet  A  has 
cardinality  at  moat  £  ■  HA t. 


1.0.  Introduction 

Matroids  can  be  regarded  as  a  certain  abstraction 
of  matrices  [8].*  They  represent  the  properties  of 
matrices  which  are  invariant  under  elementary  row 
operations  but  which  are  not  invariant  under  elemen¬ 
tary  column  operations  —  namely  properties  of  depend¬ 
ence  among  the  columns.  For  any  matrix  over  any 
field,  there  is  a  matroid  whose  elements  correspond 
to  the  columns  of  the  matrix  and  whose  independent 
sets  of  elements  correspond  to  the  linearly  independent 
sets  of  columns.  A  matroid  M  is  completely  deter¬ 
mined  by  its  elements  and  its  independent  sets  of 
elements. 

The  same  letter  will  be  used  to  denote  a  matroid 
and  its  set  of  elements.  The  letter  /  with  various  sub 
or  superscripts  will  be  used  to  denote  an  independent 
set. 

The  interest  of  matroids  does  not  lie  only  in  how  they 
generalize  some  known  theorems  of  linear  algebra. 
There  are  examples,  which  I  shall  report  elsewhere, 
of  matroids  which  do  not  arise  from  any  matrix  over 
any  field—  so  matroid  theory  docs  truly  generalize  an 
aspect  of  matrices.  However,  matroid  theory  is  jus¬ 
tified  by  new  problems  in  matrix  theory  itself — in  fact 
by  problems  in  the  special  matrix  theory  of  graphs 
(networks).  It  happens  that  an  axiomatic  matroid  set¬ 
ting  is  most  natural  for  viewing  these  problems  e  nd  that 
matrix  machinery  is  clumsy  and  superfluous  for  view¬ 
ing  them.  The  situation  is  somewhat  similar  to  the 
superfluity  of  (real)  matrices  to  the  theory  of  linear  op¬ 
erator*;,  though  there  a  quite  different  aspect  of  mat¬ 
rices  is  superfluous.  When  it  comes  to  implementing 
either  theory,  matrices  are  often  the  way  to  do  it. 

Matroid  theory  so  far  has  been  motivated  mainly 
by  graphs,  a  special  class  of  matrices.  A  graph  G  may 
be  regarded  as  a  matrix  N(G)  of  zeroes  and  ones,  mod  2, 
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which  has  exactly  two  ones  in  each  column.  The 
columns  are  the  edges  of  the  graph  and  the  rows  are 
the  nodes  of  the  graph.  An  edge  and  a  node  are  said 
to  meet  if  there  is  a  one  located  in  that  column  and 
that  row.  Of  course  a  graph  can  also  be  regarded 
visually  as  a  geometric  network.  It  is  often  helpful 
to  visualize  statements  on  matroids  for  the  case  of 
graphs,  though  it  can  be  misleading.  Matroids  do 
not  contain  objects  corresponding  to  nodes  or  rows. 

Theorem  1  on  “minimum  partitions,”  the  subject  of 
this  paper,  was  discovered  in  the  process  of  unifying 
results  described  in  the  next  paper,  “On  Lehman's 
Switching  Came  and  a  Theorem  of  Tutte  and  Nash- 
Williams"  (denoted  here  as  “Part  II”),  which  is  a  direct 
sequel.  Theorem  1  is  shown  there  to  be  closely  re¬ 
lated  to  those  results.  Lately,  I  have  learned  that 
Theorem  1  for  the  case  of  graphs  (se;  sec.  1.7)  was 
anticipated  by  Nash- Williams  [5J. 

By  borrowing  from  work  of  others,  I  intend  that  this 
paper  together  with  possible  sequels  be  partly  exposi¬ 
tory  and  technically  almost  self-contained. 

1.1.  Tha  Problam 

Various  aspects  of  matroids— in  particular,  the  first 
pair  of  axioms  we  cite— hold  intrinsic  interest  which 
is  quite  separate  from  linear  algebra. 

AXIOM  1:  Every  subset  of  an  independent  set  of 
elements  is  independent. 

Any  finite  collection  of  elements  and  family  of  so- 
called  independent  sets  of  these  elements  which  satis¬ 
fies  axiom  1  we  shall  call  an  independence  system. 
This  also  happens  to  be  the  definition  of  an  abstract 
simplicial  complex,  though  the  topology  of  complexes 
will  not  concern  us. 

It  is  easy  to  describe  implicitly  large  independence 
systems  which  are  apparently  very  unwieldy  to  an¬ 
alyze.  First  example:  given  a  graph  G,  define  an 
independent  set  of  nodes  in  G  to  be  such  that  no  edge 
of  G  meets  two  nodes  of  the  set.  Second  example: 
define  an  independent  set  of  edges  in  G  to  be  such  that 
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no  node  meets  two  edges  of  the  set.  Third  example: 
define  an  independent  set  of  edges  in  G  to  be  such  that 
the  edges  of  the  set,  as  column  vectors  of  N{G),  are 
linearly  independent.  The  third  example  is  the  pro¬ 
totype  of  the  systems  we  shall  study  here. 

A  minimum  coloring  of  the  nodes  of  a  graph  G  is  a 
partition  of  the  nodes  into  as  few  sets  (colors)  as  pos¬ 
sible  so  that  each  set  is  independent.  A  good  char¬ 
acterization  of  the  minimum  colorings  of  the  nodes  in 
a  graph  is  unknown  (unless  the  graph  is  bipartite,  i.e., 
the  nodes  can  be  colored  with  two  colors).  To  find 
one  would  undoubtedly  settle  the  “four  color" 
conjecture. 

A  problem  closely  related  to  minimum  coloring  is 
the  “packing  problem."  That  is  to  find  a  good  char¬ 
acterization  (and  an  algorithm)  for  maximum  cardinal¬ 
ity  independent  sets.  More  generally  the  “weighted 
packing  problem”  is,  where  each  element  of  the  system 
carries  a  real  numerical  weight,  to  characterize  the 
independent  sets  whose  weight-sums  are  maximum. 
The  packing  problem  for  the  systems  of  the  first 
example  is  also  very  much  unsolved  (unless  the  graph 
is  bipartite). 

The  minimum  coloring  problem  for  the  systems  of 
the  second  example  is  unsolved  (unless  the  graph  is 
bipartite).  Its  solution  would  also  undoubtedly  set¬ 
tle  the  four-color  conjecture.  However  the  packing 
problem,  and  more  generally  the  weighted  packing 
problem,  is  solved  for  the  second  example  by  the  ex¬ 
tensive  theory  of  “matchings  in  graphs." 

For  the  third  example  the  packing  problem  is  in  a 
sense  trivial.  It  is  well  known  that  the  system  of 
linearly  independent  sets  of  edges  in  a  graph,  and 
more  generally  the  system  of  linearly  independent 
sets  of  columns  in  a  matrix,  satisfies  the  following: 

AXIOM  2:  For  any  subset  A  of  the  elements,  all  maxi¬ 
mal  independent  sets  contained  in  A  contain  the  same 
number  of  elements. 

A  matroid  is  a  (finite)  system  of  elements  and  sets 
of  elements  which  satisfies  axioms  1  and  2. 

For  any  independence  system,  any  subsystem  con¬ 
sisting  of  a  subset  A  of  the  elements  and  all  of  the 
independent  sets  contained  in  A  is  an  independence 
system.  Thus,  a  matroid  is  an  independence  system 
where  the  packing  problem  is  postulated  to  be  trivial 
for  the  system  and  all  of  its  subsystems.  For  me,  hav¬ 
ing  spent  much  labor  on  packing  problems,  it  is 
pleasant  to  study  such  systems.  Matroids  have  a 
surprising  richness  of  structure,  as  even  the  special 
case  of  graph i<  matroids  shows. 

Clearly,  a  subsystem  of  a  matroid  M  is  a  matroid. 
We  call  it  a  submatroid  and  we  use  the  same  symbol 
to  denote  it  and  its  set  of  elements.  The  rank,  riA), 
of  a  set  A  »»t  elements  in  M  or  the  rank,  riA),  of  the 
submatroid  i  of  M  is  the  number  of  elements  in  each 
maximal  independent  set  contained  in  A,  i.e.,  the  num¬ 
ber  of  elements  in  a  base  of  4 

The  main  result  of  this  paper  is  a  solution  of  the 
minimum  coloring  problem  h>r  the  independent  sets 
of  a  matroid.  Another  paper  will  treat  the  weighted 
packing  piol  lent  tor  matroids. 


1 .2.  Ground  Hules 

One  is  tempted  to  suimise  that  a  minimum  coloring 
can  be  effected  for  a  system  by  some  simple  process 
like  extracting  a  maximal  independent  set  to  take  on 
the  first  color,  then  extracting  a  maximal  independent 
set  of  what  is  left  to  take  on  the  second  color,  and  so 
on  till  all  elements  are  colored.  This  is  usualk  tar 
from  being  successful  even  for  matroids,  though  it 
is  precisely  matroids  for  which  a  similar  sort  of  nmno- 
tonic  procedure  always  yields  a  maximum  cardinatit' 
independent  set  and,  as  we  shall  see,  in  another  paper, 
also  always  yields  a  maximum  weight  sum  indepei.de:  . 
set  when  the  elements  carry  arbitrary  real  weight:-. 

Consider  the  class  of  matroids  implicit  in  'he  chi' 
Mr  of  all  matrices  over  fields  ol  integers  modulo  pi  ime- 
(For  large  enough  prime,  this  class  include-  tin- 
matroid  of  any  matrix  over  the  rational  field.'  W 
seek  a  good  algorithm  for  partitioning  the  ,  •  —  in t,.- 
leicments  of  the  matroidl  of  any  one  of  tin-  'i:;i'-i 
(matroids)  into  as  few  sets  as  possible  so  that  <■:.<  h  >■■■ 
is  independent.  Of  course,  by  carrying  out  :h< 
tonic  coloring  procedure  described  above  in  ,ui  ...  -i! 
ways  for  a  given  matrix,  one  can  be  assured  of  nrm  •. 
tering  such  a  partition  for  the  matrix,  but  t:n  vou;<! 
entail  a  horrendous  amount  of  work.  We  sc:  k  an  a: 
gorithm  for  which  the  work  involved  inert  a -  oui 
algebraically  with  the  size  of  the  matrix  to  v  ,  h  n  i- 
applied,  where  we  regard  the  size  of  a  i,i.r  o  a  a 
creasing  only  linearly  with  the  number  ■>,  -i.iunns, 
the  number  ol  rows,  and  the  charade*  i.-tn-  a  ne  ti  . 
As  in  most  combinatorial  problems,  hmi.aa  u  unite 
algorithm  is  trivial  but  finding  an  algori:  u  which, 
meets  this  condition  for  practical  feasilu-it)  is  not 
trivial. 

We  seek  a  good  characterization  ol  'In-  minimum 
number  of  independent  sets  into  which  the  column.- 
of  a  matrix  of  Mr  can  be  partitioned.  As  1  to'  criterion 
of  “good"  for  the  characterization  we  apply  the  “prin¬ 
ciple  of  the  absolute  supervisor."  Tin  good  charac¬ 
terization  will  describe  certain  inlo, 'in.it ion  about  the 
matrix  which  the  supervisor  can  require  (us  assistant 
to  search  out  along  with  a  minimum  partition  and 
which  the  supervisor  can  then  use  with  ease  to  verify 
with  mathematical  certainty  that  the  partition  is  in¬ 
deed  minimum.  Having  a  good  characterization  does 
not  mean  necessarily  that  there  is  a  good  algorithm. 
The  assistant  might  have  to  kill  himself  with  work  to 
find  the  information  and  the  partition. 

Theorem  1  on  partitioning  matroids  provides  the 
good  characterization  in  the  case  of  matt  ices  of  Mr. 
The  proof  of  the  theorem  yields  a  good  algorithm  in 
the  case  of  matrices  of  My.  (We  will  not  elaborate  on 
how.)  The  theorem  and  the  proof  appb  as  well  to 
all  matroids.  via  the  matroid  axioms.  However,  the 
“goodness”  for  matrices  depends  on  being  able  to 
cariv  out  constructively  with  ea-"  those  matrix  opera¬ 
tion-  which  correspond  to  the  existential  asset:. oils 
ol  lie  theory.  A  fundamental  problem  of  matroid 
theory  is  to  find  a  good  representation  for  general 
matroids  — good  perhaps  relative  to  the  rank  and  t lie 
number  of  elements  in  the  matroids  There  is  a  very 
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elegant  lattice  representation  (geometric  lattices,  [1, 
2]),  bat  it  is  not  something  you  would  want  to  record 
except  for  the  very  simplest  matroids. 

1 .3.  The  Thsorsan 

The  cardinality  of  a  set  A  is  denoted  by  \A\.  The 
rank  of  a  set  A  is  denoted  by  r\A), 

Theorem  1:  The  elements  of  a  matroid  M  can  be 
partitioned  into  as  few  as  k  sets,  each  of  which  is  inde¬ 
pendent,  if  and  only  if  there  is  no  subset  A  of  elements 
of  M  for  which 


|A|  >  k  •  r( A). 

The  theorem  makes  sense  for  any  independence 
system  M  if  we  define  the  rank  r{A)  of  any  subset  A 
to  be  the  maximum  cardinality  of  an  independent  set 
in  A.  In  fact,  the  "only  if  part  of  the  theorem  is 
true  for  any  independence  system  M.  Let  /j(t  =  l, 
.  .  k)  be  k  independent  sets  in  M  for  which 

U  l  =M. 

i-i 

For  any  subset  A  of  M,  \hC\A\  «  r(A)  and 
J;  \i<nA\*k-«A). 

Thus  the  “only  if  part  is  proved. 

In  general  for  the  coloring  problem  in  nonmatroidal 
systems,  the  other  half  of  the  theorem  is  not  true. 
However,  the  Konig  theorem  on  matchings  in  bi¬ 
partite  graphs  can  be  regarded  as  a  valid  instance  of 
theorem  1  for  certain  nonmatroidal  systems.  A  bi¬ 
partite  graph  is  a  graph  whose  nodes  can  be  parti¬ 
tioned  into  two  sets  each  independent  (by  coincidence, 
an  instance  of  the  coloring  problem  in  our  first  ex¬ 
ample).  The  Konig  theorem  says  that  for  a  bipartite 
graph  G  the  minimum  number  of  nodes  which  meet 
all  the  edges  equals  the  maximum  number  of  edges 
such  that  no  node  meets  more  than  one  of  them. 
(This  theorem  solves  the  packing  problem  for  a  special 
case  of  our  second  example  of  independence  system.) 

Fourth  example:  For  a  graph  C,  let  the  elements  of 
the  system  M  be  the  edges  of  G.  For  each  node  of 
G,  let  the  set  of  edges  which  meet  the  node  be  an  in¬ 
dependent  set  in  M.  Let  the  subsets  of  these  sets 
be  the  rest  of  the  independent  sets  in  M.  The  Konig 
relation  for  a  graph  G  implies  theorem  1  for  system  M. 

Theorem  1  for  the  system  M  arising  from  G  does 
not  imply  the  Konig  theorem  for  G.  For  independence 
systems  in  general  the  relation  represented  by  theorem 
1  is  weaker  than  the  relation  represented  by  the  Konig 
theorem— the  latter  being  that  the  minimum  number 
of  independent  sets  which  together  contain  all  the 
elements  equals  the  maximum  number  of  elements 
in  a  set  of  rank  one.  It's  nice  to  have  the  weaker 
relation  of  theorem  1  because  it  might  apply  to  other 
systems  where  the  well  known  Konig  relation  does  not. 


1.4.  Terminology 


There  are  various  families,  (1)  through  (6),  of  subsets 
of  the  elements  in  a  matroid  M  which  are  used  in  de¬ 
scribing  the  structure  of  M. 

(1)  The  family  of  independent  set  of  M. 

(2)  The  family  of  minimal  dependent  sets  of  ele¬ 
ments  in  M  (where  dependent  means  not  independent). 
These  are  called  the  circuits  in  M.  The  letter  C  with 
various  sub  or  superscripts  will  be  used  to  denote  a 
circuit. 


(3)  The  family  of  spans  or  closed  sets  in  M.  A  span 
S  in  M  is  a  set  of  elements  such  that  no  circuit  of  M 
contains  exactly  one  element  not  in  S.  That  is, 
isncj  *  1  for  every  circuit  C  in  M. 

The  span  or  closure  of  a  subset  A  of  M  is  the  minima) 
span  in  M  which  contains  A.  Clearly,  the  span  of  A, 
which  we  always  denote  by  S(/f),  is  unique.  Where  A 
is  a  subset  of  column  vectors  in  a  matrix  M  of  column 
vectors,  S(/4)  is  all  the  columns  in  M  which  are  linear 
combinations  of  A. 


The  terms  above  are  used  extensively  in  section  i.5 
and  section  1.6  to  prove  theorem  1.  The  terms  below, 
through  (4)  and  (5).  are  used  extensively  in  Part  11. 

A  subset  A  of  M  is  said  to  span  a  subset  K  of  M  when 
KC  S(A).  It  follows  from  proposition  4,  to  come,  that 
A  spans  K  in  M  if  and  only  if  for  each  element  ecK 
either  eeA  or  their  is  a  circuit  C  of  M  such  that  eeC 
and  C  —  eCA. 

(4)  The  family  of  spanning  sets  of  M.  A  spanning 
subset  of  M  is  a  subset  of  M  which  spans  M— in  other 
words,  a  subset  of  M  whose  span  is  M. 

(5)  The  family  of  bases  of  M.  A  base  of  M  is  a 
maximal  independent  set  of  M.  A  base  can  also  be 
defined  as  a  minimal  spanning  set  of  M. 

The  terms  in  (1),  (2),  and  (5)  are  taken  from  Whitney 
[8].  The  terms  “closed  set”  and  "span  of  A"  are 
taken  from  Lehman  [3].  There  is  an  alternative 
terminology  due  to  Tutte  [7].  Since  these  are  major 
sources  on  matroids,  it  is  worthwhile  to  set  down  the 
relationship.  To  do  so  it  is  necessary  to  invoke  the 
much  used  notion  of  “dual  matroid,"  though  it  i«  not 
used  here  or  in  Part  II.  Papers  [3],  [7],  and  [8] 
show  that  the  set-complements  of  the  bases  n  a 
matroid  M  are  the  bases  of  a  so-called  dual  matroid  ivl*. 

The  bases  of  matroid  M  are  called  by  Tutte  the 
dt  ndroids  of  M.  The  elements  of  hi  are  called  by  Tutte 
the  cells  of  M.  The  independent  sets  of  M  are  called 

byten»'/t;5T  "iid  M  ««  »h«  Tutte  cell,  the 
atoms  of  dual  matroid  M*.  The  circuits  of  M*  are  the 
atoms  of  M.  Thus  here  is  another  special  family  of 
subsets  of  a  matroid  M. 

(6)  The  family  of  atoms  (dual  circuits)  in  M. 

The  rows  of  a  matrix  N «,  under  addition  and  sub¬ 
traction,  generate  a  group  of  row  vectors  which  Tutte 
calls  a  chain-group,  say  the  chain-group  N  of  matrix 
Nt.  The  matroid  M  of  matrix  A*  is  of  course  an  in- 
varient  of  chain-group  N,  and  it  is  what  Tutte  calls  the 
matroid  of  chain-group  N.  An  atom  of  M  of  Af  is  de¬ 
fined  as  a  set  of  elements  in  M  which  corresponds  to 
a  minimal  nonempty  set  of  row-vector  components 
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such  that  there  is  some  member  of  chain-group  V 
which  has  its  nonzero  values  in  precisely  these  coir 
ponents.  The  row-vectors  orthogonal  to  each  row  of 
matrix  .Ye  form  another  chain-group,  say  V*.  Its 
matron!  is  \t*,  the  dual  of  M.  Atoms  of  M *  by  defini¬ 
tion  correspond  to  minimal  dependent  sets  of  columns 
in  matrix  Vo.  That  is,  they  are  the  circuits  of  the 
matroid  M  of  Vo. 

Tutte  defines  a  flat  of  matroid  M  to  be  a  union  of 
atoms  of  M,  or  the  empty  set.  It  can  be  shown  that  a 
flat  of  M  is  the  set-complement  of  a  span  (closed  set) 
in  M,  and  conversely. 

Where  A  is  a  subset  of  elements  in  M,  Tutte  denotes 
by  M  •  A  what  here  is  called  the  submatroid  A  of  M 
(following  Whitney).  The  meanings  of  the  rank  riM) 
of  matroid  M  coincide,  and  Tutte  denotes  by  r{M  -  A) 
what  here  is  called  the  rank  r(A)  of  set  A  in  M  (follow¬ 
ing  Whitney).  However,  for  a  set  A,  what  Tutte  de¬ 
notes  by  r  A  is  not  r\A)  —  r(M  -A)  but  “r\M  XA)"  which 
is  used  in  Part  II. 

1 .5.  The  Lemmas 

In  the  proof  of  theorem  1  we  will  use  axiom  1  and 
the  following  axiom  2'  for  matroids  instead  cf  axioms 
1  and  2. 

AXIOM  2':  The  union  of  any  independent  set  and 
any  element  contains  at  most  one  circuit  (minimal 
dependent  set). 

Proposition  1:  Axioms  1  and  2'  are  equivalent  to 
axioms  1  and  2. 

PROOF:  Assuming  1  and  2,  suppose  independent  set 
I  together  with  element  e  contains  two  distinct  circuits 
Ci  and  Ct.  Assume  1  is  minimal  for  this  possibility. 
etCiC\Ci.  There  is  an  element  e,tC,  —  Ct  and  an  ele¬ 
ment  eytCi-C ,.  Set  / Ue-e.  — e,  is  independent 
since  otherwise  (/  —  ei)  is  a  smaller  independent  set 
than  /  for  which  (/  — eOUe  contains  more  than  one 
circuit.  Set  /  and  set  / Ue~ei  — ej  are  maximal  in¬ 
dependent  subsets  of  set  /  Ue.  This  contradicts 
axiom  2. 

Assuming  1  and  2',  suppose  I,  and  /*  are  both  maxi¬ 
mal  independent  subsets  of  a  set  >4  such  that  |/( |  <  |/*| . 
Assume  /iU/«  is  minimal  for  this  possibility.  There 
is  an  ei  in  /i-/i  and/fUet  is  dependent.  By  2\  /tile, 
contains  a  unique  circuit  C  which  must  contain  some 
element  et  not  in  I,.  Since  h  is  larger  than  I,  it  must 
contain  another  element  besides  e*  not  in  /i  and  hence 
/iU/i1- et  is  dependent.  Therefore,  since  /|Uei  —  ei 
is  independent,  there  is  some  I j  such  that  e,tl\Cl,  —  h 
and  such  that  =  e»  i»  maxims!  independent 

in  A.  Because  contains  an  element  not  in  h, 
|/i|  3*  )/j|  >  |/i|.  However,  since  /iU/J  is  a  proper  sub- 
set  of  I,  U/i,  this  contradicts  the  minimality  assumption 
for  l,  U/i.  The  proposition  is  proved. 

PROPOSITION  2:  Axioms  I  and  2'  are  equivalent  to 
the  following  axioms,  lf  and  2,.,  for  a  matroid  in  terms 
of  its  circuits  (where  starting  with  circuits,  independent 
sets  are  defined  as  sets  containing  no  circuits). 

AXIOM  !<■:  No  circuit  contains  another  circuit. 

AXIOM  2?:  If  distinct  circuits  C i  and  Ct  both  contain 
an  element  e  then  C,UC,  —  e  contains  a  circuit. 


A  proof  of  proposition  2  is  obvious. 

The  next  very  useful  proposition  is  taken  in  [7 j  and 
[8]  to  be  an  axiom  instead  of  2f.  Alfred  Lehman  dis 
covered  that  lc  and  2e  suffice. 

PROPOSITION  3:  If  C,  and  Cl  are  circuits  of  a  matroid 
M  with  an  element  eeC,DCt  and  an  element  aeC,  — 
Ct,  then  there  is  a  circuit  C  such  that 

aeCCC,UC,-e. 

PROOF  (Lehman):  Assuming  lr  and  2,-,  suppose 
Ci,  Ct,  a,  and  e  are  such  that  the  theorem  is  false  and 
CiUC-t  is  minimal.  There  is  a  circuit  C.iCCiUCj  —  e, 
but  aiCt-  There  is  an  element  ieCaDtCii  —  Ci).  By 
minimality  of  C1UC2  for  falsity  of  the  theorem  and 
since  aiCtilCt,  there  is  a  circuit  C«  such  that 
etCiCCtiiCs  —  b.  Again  by  the  minimality  and  since 
b^CiUCt,  there  is  a  circuit  C  such  that 

afCCCiUCi-eCCtUCt-e, 

contradicting  the  falsity  of  the  theorem. 

Proposition  4:  An  element  e  of  a  matroid  M  is  in 
the  span  S(A)  of  a  set  A  in  M  if  and  only  if  e  is  in  A  or 
there  is  a  circuit  C  of  M  for  which  C  —  A  =  e. 

Proof:  The  “if’  part  of  the  theorem  is  asserted 
in  the  definition  of  span.  Assuming  the  “only  if’ 
part  false,  by  the  definition  of  span  there  must  be  an 
A  and  etS(A)  —  A  for  which  there  is  no  C  with  C  —  A  —  e 
but  for  which  there  is  a  C  and  nonempty  E  with 
C-{AUE)  =  e  where  for  each  e'tE  there  is  a  C'  with 
C' —  A=e‘.  Assume  E  to  be  minimal  so  that  ECC. 
By  prop.  3,  for  any  e'  and  C'  there  is  a  Ci  for  which 
eeCiCCUC'  — e'.  Hence,  Ci—  (A UE,)  —  e  where  E, 
is  a  proper  subset  of  E,  contradicting  the  minimality 
of  E. 

Besides  axioms  1  and  2’  and  the  definitions  of  circuit 
and  span,  the  only  other  fact  on  matroids  used  to  prove 
theorem  1  is 

Proposition  5:  The  span  of  a  set  A  in  a  matroid 
M  is  the  (unique)  maximal  set  S  in  M  which  contains 
A  and  which  has  the  same  rank  as  A. 

In  particular  the  additional  fact  used  in  proving 
theorem  1  is  that  the  span  of  an  independent  set  /  has 
rank  equal  to  the  cardinality  of  /. 

Proof  of  PROP  5:  If,  for  S(/4)  the  span  of  A,  r(S(A )) 
>  r(A),  then  by  axiom  2  a  base  /  of  A  is  not  a  base  of 
S(A),  i.e.,  there  is  an  element  eeS(A)  —  /  such  that 
/ Ue  is  independent.  By  prop.  4,  e  is  not  in  the  span 
S(l)  of  /  but  A  is  in  Si/).  Since  the  span  of  a  set  is 
the  minimal  span  containing  the  set,  S(A)CS(l).  Thus, 
by  contradiction,  i]S(/4))  =  r(A). 

Let  etS'(A)  where  ACS'iA)  and  riA)  =  riS'(A)).  Then, 
where  /  is  a  base  of  A,  either  eel  or  eU /  is  dependent. 
Thus  eeS(A).  Therefore,  S(/<)  is  the  unique  maximal 
set  where  A  CS(A)  and  r(S(A))  =  r\A ). 

1 .6.  The  Main  Proof 

Proof  of  theorem  1  (the  “if”  part):  Assume  that  for 
every  subset  A  of  matroid  M,  \A\<  k  •  t\A).  Actually, 
it  is  sufficient  tiiat  for  every  span  S  in  M,  |S|«  k ■  /IS). 
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The  goal  is  to  get  all  the  elements  of  Xi  into  just  k  in¬ 
dependent  eets  of  M.  Let  F  be  a  family  of  k  mutually 
disjoint  independent  sets  of  M.  Any  number  of  these 
sets  may  be  empty.  These  sets  are  to  be  regarded  as 
labeled  so  that  each  may  be  altered  in  the  course  of 
the  proof  while  still  maintaining  its  label-identity. 
Suppose  there  is  an  element  x  of  M  such  that  U  {/ :  h.F) 
CM-x.  We  shall  see  how  to  rearrange  elements 
among  the  members  of  F  to  make  room  for  x  in  one  ol 
them  while  preserving  the  independence  (and  mutual 
disjointness)  of  them  all.  The  process  can  be  re¬ 
peated  until  each  element  of  M  is  in  a  member  of  F. 
Thus  the  theorem  will  be  proved. 

Implementing  this  proof  to  an  algorithm  for  par¬ 
titioning  (if  possible)  a  matroid  M  into  k  independent 
sets  is  quite  straight-forward  as  long  as  an  algorithm 
is  known  for  the  following:  for  any  ACM  and  ee\f, 
find  a  circuit  C  such  that  eeCCAUe  or  else  determine 
that  there  is  none.  In  the  algorithm  for  partitioning 
M,  one  of  course  would  not  first  verify  \A  |  «  k  ■  riA)  for 
all  AQM,  but  would  simply  proceed  on  the  assumption 
that  it  is  true  a.jd  then  stop  if  a  contradiction  arises. 

If  every  member  of  F  contained  as  many  as  r\M) 
elements,  then  since  they  are  disjoint  and  do  not 
contain  x ,  the  union  of  all  k  of  them  together  with  x, 
which  is  a  subset  of  M,  would  have  cardinality 
greater  than  k  ■  r\M).  However,  |.W|<  k  ■  r(M).  Hence 
there  is  an  l,fF  for  which  j/i|<r(Af).  Similarly, 
xeSi^Sdt)  implies  that  there  is  an  IttF  for  which 
1/jflSil  <r(Si),  since  if  each  member  of  F  had  riSj 
elements  in  Si,  then  their  union  together  with  x 
would  be  more  than  k-tiS,)  elements  in  Si,  but  |St  j  « 
k-r(Si). 

Denoting  M  by  So,  then  likewise  in  general 
xiS,=SU<ns,-)) 

implies  that  there  is  an  h*ttF  for  which  j/i+i flSjJ  < 
rfSi),  since  |Si|  <  k  •  ASi).  These  // s  are  not  neces¬ 
sarily  distinct  members  of  F. 

Where 

s,+,*S(/,,,ns,). 

we  have 

KSi.i)  <  riSi). 

Since  rank  is  a  nonnegaiive  integer,  we  must  eventually 
reach  an  integer  h  for  which 

x#s*=S(/»ns».  i) 

and 

xtSi  for  i  =*  1 ,  .  .  . ,  h  -  1 . 

By  construction,  S,DStD  .  .  .  DS». 

If  /«Ui  is  independent  then  replacing  /*  by  /*L fat 
disposes  of  x.  Otherwise  there  is  •  unique  circuit 
CC/*Ux.  Since  C  —  xCSn-i  would  imply  x«5» 
*S(/*nS»- 1),  there  is  an  xteC-x  such  tha;  xifS*  t. 


We  replace  /*  in  F  by  independent  set  hUx  —  x>. 
The  new  family  is  still  called  F  and  the  new  set 
carries  the  label-identity  in  F  which  /*  had.  This 
and  the  following  informal  conventions  are  used  simply 
to  avoid  introducing  a  lot  more  indices.  Any  other 
/ i  which  was  the  same  member  of  F  as  /*  is  now 
IhUx  —  xi.  We  will  distinguish  between  the  current 
/ 1  and  the  original  It.  The  Si’s  do  not  change. 

We  have  disposed  of  x  and  now  we  must  find  a 
place  for  xi  in  some  member  of  F.  Since  x^Sx-i 
and  xtfSi >,  and  since  the  Si’s  are  monotonically  netted, 
there  is  some  index  i(l)  <  h  —  1  for  which 

■*i€Skd  and  XieSin)-|. 

Denote  h  by  i(0)  and  denote  x  by  Xq.  Assume  induc¬ 
tively  that  xoiSmu  *oeS«0>-i,  Xi^Sko,  *ieS«i>-i,  .  . 
Xj4S,(j).  XjfSuj,  - 1,  where  t(0)>i(l)>.  .  .  >  i(J).  As¬ 
sume  further  that  /«o)  was  replaced  in  F  by  /wUxii 
—  Xu  then  Itat  was  replaced  in  F  hy  /kuUxi— *i,  .  .  ., 
and  then  Itq-t,  was  replaced  in  F  by  /iy-i)Uxj-i—  xy, 
where  XitCoC/woiUx*,  XtfC ,C/ni>Ux,,  .  .  and 
Xj(Cj- 1 C  laj- 1 )  U  Xj~  i . 

Suppose  there  is  a  circuit  CjC/iyiUxj.  Set  ly, 
might  have  the  same  label-identity  in  F  as  /«,>  for 
several  values  of  q  <  j,  and  so  the  contents  of  /n»  may 
have  changed  several  times  since  the  original  Ihj,  which 
gave  rise  to  Siu)  =  S(/iy>nSio;  ().  In  particular,  x„ 
for  some  q  <  j  may  have  been  adjoined  to  Ity,-  How¬ 
ever,  by  the  induction  hypothesis  any  such  x,  is 
contained  in  Sk9)-  i  and  thus  in  Sty 

Therefore  all  elements  of  Cj  —  Xi  which  are  not  in 
the  original  Ito)  are  in  S«o>-  By  definition  of  Smj>,  all 
elements  of  the  original  I*j)  which  are  in  Shj)  - 1  are  also 
in  Skj).  Thus  if  all  elements  of  Cj  —  xj  ere  in  S«j> _  i  then 
they  are  all  in  Sio>.  but  since  is  a  span  then  xj  also 
would  be  in  Sio>,  contradicting  the  inductive  hypothesis. 
Hence,  there  exists  some  element  xj*  i  of  Cj  such  that 
xj  *  ifSttj)  - 1 .  Since  xj  ♦  i«S»,  there  is  tome  i(j  + 1)  <  i(j) 
such  that  Xj ♦  \fSnj *  ,,  and  xj*  itSty  *  n_  i. 

Therefore  when  there  exists  a  Cj,  we  repeat  the  in¬ 
ductive  step  by  replacing  In,,  by  ho,  U  xj-xj  +  i. 

Since  i(0)  >  i'(  1  >  >  .  .  .,  eventually  we  must  reach  an 
«ty)  f°r  which  there  is  no  CjC/kj,Uxj  Then  we  can 
replace  /iy>  in  F  by  independent  set  Ihj,  U  Xj  without 
having  to  displace  another  element  aj*i.  End  of 
proof. 

1.7.  Corollary 

For  the  special  case  wherr  M  is  the  matroid  of  a 
graph  C,  theorem  1  can  be  simplified  somewhat: 

Corollary  (Nash-Williams  [5]):  The  edges  of  a 
graph  G  can  be  colored  with  as  few  as  k  colors  so  that 
no  circuit  of  G  is  all  one  color,  if  and  only  if  there  is  no 
subset  U  of  nodes  in  G  such  that,  where  Eu  is  the  set  of 
edges  in  G  which  have  both  ends  in  U, 

|EJ  >k(|U|  — 1). 

Symbols  \U\  and  |£V|  denote,  respectively,  the 
caidinalities  of  (/  and  £(. 
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Not  every  subset  A  of  elements  in  the  matroid 
MiG)  of  G,  nor  even  every  closed  set  A  of  elements  in 
MiG),  corresponds  to  a  set  of  edges  of  type  EL>.  How¬ 
ever,  the  relation  \A  |  £  k  •  riA)  for  every  set  A  corre¬ 
sponding  to  a  set  Ev  of  edges  which  form  a  connected 
subgraph  of  G  implies  the  relation  for  every  subset  A 
of  elements  in  MiG). 

The  corollary  follows  (we  omit  the  proof)  from 
theorem  1  by  using  the  following  characterization  of 
the  rank  function  of  a  graph  due  to  Whitney: 

The  rank  riE)  of  any  subset  E  of  edges  in  G,  i.e.,  the 
rank  of  the  matroid  subset  corresponding  to  £',  equals 
the  number  of  nodes  minus  the  number  of  connected 
components  in  the  subgraph,  G  •  E,  consisting  of  the 
edges  E  and  the  nodes  they  meet,  or  equivalently  in  the 
-  airgraph,  G  :  F,  consisting  of  the  edges  F.  and  all  the 
nodes  of  G.  The  notation  G  ■  F.  and  G :  F  is  due  to 
Tutlc,  chapter  III  of  [7] . 
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Lehman's  Switching  Game  and  a  Theorem  of 
Tutte  and  Nash-Williams1 

lack  Edmonds 

(Drcrmber  1,  1964) 

Thr  rrsuli#  cited  in  the  title  are  unified  by  the  following  theorem:  For  any  matroid  M  and  any 
aiibsrts  N  and  K  of  elements  in  M,  there  exist  as  many  as  k  disjoint  subsets  of  N  which  span  K  and 
which  span  each  other  if  and  only  if  there  is  no  contraction  matroid  MxA  where NC\A  partitions  into 
as  few  as  k  sets  such  that  each  is  independent  in  M  XA  and  such  that  at  least  one  of  them  does  not 
span  AO  A  in  M  *  A. 


2.1.  The  Problem  The  purpose  of  the  present  note  is  to  unify  these  two 

theories.  Theorem  2  states  the  straightforward  gen- 
A.  Lehman  [3] 2  posed  the  following  game  to  be  eralization  to  matroids  of  the  Tutte  and  Nash-Williams 
played  between  two  players  on  any  given  matroid  M  theorem.  Theorem  3  is  Lehman’s  main  theorem  char- 

with  a  distinguished  element  e.  The  players  are  called  acterizing  short  games.  Theorem  4  is  an  analogous 

the  cut  player  and  the  short  player.  They  take  turns  iheorem  characterizing  nonshort  games.  (Lehman 

and  (to  be  explicit)  the  cut  player  goes  first.  Each  characterizes  nonshort  games  indirectly  by  using  “dual 

player  in  his  turn  tags  an  element  of  M,  other  than  e,  matroids”  which  we  avoid.)  Theorem  5,  for  the  case 

not  already  tagged.  The  short  player  wins  if  he  tags  where  K  —  N  —  M,  yields  theorem  2.  For  the  case 

a  set  of  elements  which  span  e.  The  cut  player  wins  where  k~  2,  it  yields  the  “only  if’  parts  of  theorems  3 

otherwise -that  is,  the  cut  player  wins  if  the  elements,  and  4.  The  “if”  parts  of  theorems  3  and  4  are  proved 

other  than  e,  which  he  has  not  tagged  d  not  span  e.  by  describing  the  winning  strategies  when  the  respec- 

The  game,  determined  by  M  and  e,  is  called  a  short  tive  conditions  hold  (in  one  case  this  follows  Lehman, 

game  if  the  short  player  can  win  against  any  strategy  [3]). 

of  the  cut  player.  We  will  call  the  game  nonshort  if  Theorem  1  in  section  1.3  and  theorem  2  are  in  a  sense 
the  cut  player  can  win  again*:  any  strategy  of  the  short  dual  to  each  other  but  not  in  the  usual  matroid  sense, 

player.  Clearly  a  game  is  one  or  the  other.  For  any  Each  can  be  proved  from  the  other.  We  use  theorem  1 

M  and  e,  Lehman  characterizes  short  games  and  de-  to  prove  theorem  S. 

scribes  a  winning  strategy  for  the  short  player.  Theorem  5  appears  interesting  in  itself.  We  call  it 

Recall  from  section  1.4  that  a  set  Tin  a  matroid  4/ is  the  "cospanning-set  theorem”  after  a  main  idea  of 
said  to  span  a  set  A  in  M  if  for  every  re  A,  either  eeT  or  Lehman’s  theory.  For  a  graph  G  with  a  prescribed 

there  is  a  circuit  C  of  M  such  the:  C  —  eCT.  Recall  subset  of  nodes  called  terminals,  it  gives  a  “good" 

that  a  base  B  of  M  is  a  set  which  spans  4f  (o.g.,  a  span-  characterization  for  the  nonexistence  of  k  edge-wise 

ning  set  of  \t)  and  which  also  is  independent.  disjoint  connected  subgraphs  (e.g.,  trees/,  aD  with 

where  the  game  is  played  on  a  graph  G,  it  is  not  precisely  the  same  set  of  nodes  which  includec  the 

necessary  to  have  an  edge  corresponding  to  e  but  terminals. 

sufficient  to  have  two  distinguished  “terminal"  nodes.  If  the  matroid  4/  of  the  cospanning-set  theorem  is  a 
t'i  and  vt,  which  would  be  the  ends  of  e.  Here,  the  goal  finite  set  of  vectors  in  a  space  L,  then  for  given  subsets 

of  the  short  player  is  to  tag  a  set  of  edges  which  con-  V  and  K  of  4#,  the  theorem  provide*  a  “good"  charac- 

tains  a  path  of  edges  joining  t>i  to  it.  The  goal  of  the  terization  for  the  nonexiatence  of  aa  many  as  k  dis¬ 
cut  player  is  to  tag  a  set  of  edges  which  separates  t  (  joint  subsets  of  /V  and  a  aubspace  V  of  L  such  that 

from  tt-  each  Nt  exactly  span*  V  and  such  that  V  contains  K. 

A  theorem  due  independently  to  Tutte  [6]  and  Nash- 
Williams  [4}  characterizes  for  any  graph  G  the  maxi-  2.2.  Contractions 

mum  number  of  edge-wise  disjoint  subgraphs,  each 

connected  and  containing  all  nodes  of  G,  into  which  We  use  (he  following  inportant  concept  on  matroids 
the  edges  of  C  can  be  partitioned.  For  a  connected  due  t0  Tutte  (ch  n  0f  [7)).  For  any  set  A  of  elements 

graph  C.  the  edges  of  a  connected  subgraph  which  con-  in  ,  mifroid  W  define  ,he  circuil,  0f  M  X  A  to  be  the 

tain*  every  node  of  G  correspond  to  the  elements  of  a  mjnima)  nonempty  intersection*  of  A  with  circuits  of  M. 
spanning  set  of  the  matroid  of  G,  and  conversely.  Proposition  6.  The  set  of  elements  A  and  the  cir- 

cuits  of  M  X  A  are  a  matroid  ( denoted  by  M  X  A),  called 
the  contraction  of  M  to  A. 

iw Proof:  Axioms  lf  and  2?  for  M*A  follow  immedi- 

M—TtiPM  "fwrf  *  ,  1  £  k£ 

1  t*  beach**  imIkM*  tV  Veratar*  rtieremce*  an  pag*  •tely  trom  prop.  3  tor  M. 


1  TV«  paper  n  t  ia  ike  •me.  “MtftttiMiw  Paeiiuaat  of  a  1 

hdrpemtrm  Sakaet*  "  TV  MtwWnnf  *r*ie<a  there.  ,»rW4*aa  reference *. 
here  Tkta  work  *h  **t#poete^  by  (kr  Army  Research  Oflkre  oVHvajo ' 
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COROLLARY:  W  here  A  and  A  are  complementary  sub¬ 
sets  of  matrvid  M,  A  Is  closed  ( a  span)  in  M  if  and  only 
if  matroid  M  x  A  contains  no  “loops,"  that  is  elements 
of  rank  zero. 

PROPOSITION  7:  Where  K  and  A  are  subsets  oj 
matroid  M,  subset  T'  of  A  spans  KOAinNlxAi/  and 
only  if  there  is  a  subset  T  of  M  such  that  T'  =  Tfl  A  and 
such  that  T  spans  K  in  M. 

COROLLARY:  The  spanning  sets  of  matroid  M  x  A 
are  precisely  the  intersections  of  A  with  spanning  sets 
ofM. 

PROOF  OF  prop.  7:  Suppose  T’  =  TC\A  where  T 
spans  A  in  M.  Since  T  spans  A,  for  any  element  e  in 
KDA,  either  eeT  or  there  is  a  circuit  C  in  M  such  that 
C-eCT.  If  etT  then  eeT'  and  hence  7”  spans  e  in 
My- A.  If  there  is  a  C  then,  by  definition  of  My  A, 
there  is  a  circuit  C'  of  My  A  such  that  eeC'CC.  It 
follows  that  C'  —eCT'  and  hence  T  spans  e  in  M  X  A. 
Thus,  the  “if’  part  is  proved. 

Suppose  subset  T'  of  A  spans  AH  ,4  in  M  y  A.  Let 
r=  r  U A  where  A  is  the  complement  of  A  in  M.  Then 
f  —  TOA.  Let  e  be  any  element  of  A.  II  eeT,  then 
T  spans  e  in  M.  Otherwise,  e«  AO  A,  ande{T'.  Since 
T  spans  e  in  My  A,  there  is  a  circuit  C'  of  M  X  A  such 
that  eeC'  and  C'  —  edT'.  By  definition  of  My  A, 
there  is  a  circuit  C  of  M  such  that  C'  —CO  A.  There¬ 
fore,  T  spans  e,  since  etC  and  C—eCT.  Thus,  the 
“only  if’  part  is  proved. 

Tutte  uses  M  A  to  denote  what  we  mean  by  the 
submatroid  A  of  M;  he  does  not  follow  Whitney’s  in 
,'ormality  of  letting  A  mean  both  a  matroid  and  its  set 
of  elements.  We  will  use  Tutte’s  notation  and  also, 
where  convenient,  we  will  depart  from  it  again  by 
referring  to  My  A  simply  as  the  contraction  matroid. 
A,  of  M  just  as  we  refer  to  M  •  A  as  the  submatroid, 
A,  of  M.  Also,  A  denotes  the  elements  of  either. 

Where  MiG)  is  the  matroid  of  graph  G,  the  matroid 
of  a  subgraph  H  of  a  graph  G  is  the  submatroid  of  MiG) 
which  contains  the  elements  corresponding  to  the 
edges  of  H  and  conversely.  The  matroid  of  a  “con¬ 
traction  graph”  H  of  G  is  the  contraction  of  WlG)  which 
contains  the  elements  corresponding  to  the  edges  of 
H,  and  conversely. 

The  most  instructive  way  to  describe  the  meaning 
of  contraction  graph  is  visuallv.  The  contraction 
graph  H  of  G  whose  edges  are  the  »et  H  ol  edges  in 
G  is  the  graph  obtained  from  C  by  contracting  to  a 
point  each  edge  of  C  not  in  H. 

It  should  be  pointed  out  that  in  order  for  there  to  be 
a  contraction  n  of  G  for  every  subset  H  of  edges  in  G, 
we  must  extend  our  meaning  of  graph  tin  sec.  I  ll  to 
graphs  which  include  edges  which  "meet  the  same 
node  at  both  ends."  These  "loop"  edges  are  circuits 
by  themselves;  they  correspond  to  matroid  elements 
which  are  not  contained  in  any  independent  set  of  the 
matroid.  This  sort  of  matroid  element  corresponds  in 
a  matrix  to  a  column  of  all  zeros.  In  a  matrix  AlG),  a 
loop  of  graph  G  can  be  represented  by  a  column  ol  NtGt 
which  contains  a  2  in  the  row  corresponding  to  the 
node  met  and  which  contains  zeros  elsewhere.  Hela 
live  to  the  matroid  structure,  the  column  is  all  zeros, 
mod.  2. 


We  have  pointed  out  how  any  contraction  of  the 
matroid  of  a  graph  can  be  represented  as  the  matroid 
of  a  graph.  It  is  also  possible  to  represent  any  con¬ 
traction  of  the  matroid  of  a  matrix  as  the  matroid  of 
a  matrix. 

By  deleting  (or  cutting)  set  of  elements  A  in  matroid 
M,  we  mean  replacing  matroid  M  by  its  submatroid  on 
the  set  M  —  A.  By  contracting  (or  shorting)  set  of 
elements  A  in  matroid  M,  we  mean  replying  matroid 
M  by  its  contraction  to  the  set  M  —  A.  Clearly,  from 
the  definition  of  submatroid,  we  can  get  a  submatroid 
M'  of  M  by  deleting  the  elements  of  M  not  in  M'  one 
after  another  in  any  order.  Clearly,  from  the  corollary 
to  prop.  7.  we  can  g  *  a  contraction  matroid  M'  of  M  by 
contracting  the  elements  of  M  not  in  M'  one  after  an¬ 
other  in  any  order.  It  can  be  proved  that  for  any 
elements  a  and  b  in  a  matroid  \l,  deleting  u  and  then 
contracting  b  is  the  same  as  contracting  b  and  then 
deleting  a.  The  proof  is  omitted.  These  results  can 
be  summarized  by  the  following: 

Proposition  8:  The  operations  of  deleting  certain 
elements  together  with  the  operations  of  contracting 
certain  other  elements  in  a  matroid  are  associative 
and  commutative. 

The  above  proposition  is  equivalent  to  Tutte’s  iden¬ 
tities  3.33  in  [7],  Tutte  defines  a  minor  of  a  matroid 
M  to  be  any  matroid  obtained  from  M  by  deleting  cer¬ 
tain  elements  and  contracting  certain  other  elements 
in  V. 

The  following  theorem  is  presented  by  Tutte  (theo¬ 
rem  3.53  of  [7]  I  in  terms  of  “dendroids.” 

PROPOSITION  9:  //  A  and  X  are  complementary  sets 
of  elements  in  matroid  M,  then  the  elements  in  a  base 
of  M  X  A  together  with  the  elements  in  a  base  of  M  ■  A 
are  the  elements  in  a  base  of  M. 

Proof  omitted. 

Corollary:  rtM  •  Al-friM  x  Xi  =  hM). 

W  e  have  been  calling  riM  •  ,4)  the  rank  riA I  of  set  A 
in  matroid  M.  We  denote  rt.V/x,4)  as  function  ft.4> 
of  sets  A  in  matroid  \1. 

The  following  theorem,  which  for  the  case  of  con¬ 
nected  graphs  is  the  one  due  to  Tuttr  and  Nash- 
Williams,  completely  parallels  theorem  1.  The  "it" 
part  of  theorem  2  follows  immediately  Iroin  the  "il” 
part  of  theorem  5  (where  M  —  V  =  A). 

Theorem  2'  The  elements  of  a  matroid  M  can  be 
partitioned  into  as  mans  as  k  sets,  each  a  spanning  set 
of  M,  tf  and  onls  1 1  there  is  no  subset  A  of  elements  oj 
V  fat  which 

| A|  *-  k  ■  tl\). 

Any  contraction  graph  of  a  connected  graph  is  con¬ 
nected.  I'smg  the  last  paragraph  ol  1.7,  observe  th.it 
where  M  is  the  mat  mid  of  a  connected  graph  G,  ft  -#  * 
is  the  nrinber  of  nodes  minus  one  o|  a  contraction 
graph  of  G,  and  j.4|  is  the  number  of  edges  in  that 
contraction  graph. 

Notice  that,  since  nA)  =  n -If)  -  nil,  theorem  2  is 
ea«il>  staled  without  the  notion  of  contraction. 

To  prove  the  "only  if’  part  of  theorem  2.  assume  that 
If  partitions  into  k  sets,  each  spanning  M.  By  taking 
a  subset  of  each  of  them,  we  get  disjoint  bases 
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Biii  =  1,  .  .  A).  Let  A  be  any  subset  of  M  and  ■  t  ,7 

be  its  complement.  Since  B,  is  independent,  rt/L 
*|7nfl,|.  Since  Bt  is  a  base,  \A nfl,|+|.?nZ?,j=  |fi,| 
=  riM).  Combining  the  two  gives  \A  C\B,\  » r{M) 
—  r(J)=t(A).  Therefore  \A\  »  ii|dnfi(|  *  k  •  t(A). 

2.3.  Short  Gamas 

It  turns  out  to  be  just  as  easy  to  analyze  games  where, 
for  the  graph  case,  any  subset  of  nodes  of  G  are  dis¬ 
tinguished  as  terminals  and  the  goal  of  the  short  player 
is  to  tag  a  set  of  edges  in  G  which  contains  the  edges  of 
a  connected  subgraph  containing  all  the  terminals.  To 
interpret  this  game  in  matroid  terms,  adjoin  to  G  a  set 
of  new  edges  which  form  a  connected  graph  k  contain¬ 
ing  precisely  the  terminals  as  nodes.  Then  relative 
to  the  matroid  of  graph  G Uff,  the  goal  of  the  short 
player  is  to  tag  a  set  of  elements  corresponding  to 
edges  in  G  which  spans  the  set  of  elements  correspond¬ 
ing  to  edges  in  k. 

For  any  matroid  M  and  nonempty  subsets  .V  and  K, 
consider  the  game  L(M,  V,  K)  where,  as  before,  the  cut 
player  and  short  player  take  turns  tagging  different 
elements  of  ,Y,  the  cut  player  going  first.  The  short 
player  wins  if  he  tags  a  set  of  elements  which  span  k. 
Otherwise,  the  cut  player  wins.  Call  IaM,  V,  k)  a 
short  game  if  the  short  player  can  win  against  any  strat¬ 
egy  of  the  cut  player. 

Lehman's  main  theorem  (explicitly  for  the  case 
where  k  is  a  single  element)  is 

THEOREM  3:  UM ,  N,  k)  is  a  $h<  rt  game  if  and  only 
if  N  contains  two  disjoint  sets,  Ac  and  B0,  of  elements 
which  span  each  other  and  which  span  k. 

Notice  that  in  the  two-terminal  graph  case,  (he  short 
player  wants  to  get  a  path  joining  the  terminals.  The 
structure  characterizing  when  he  can  is  two  edgewise 
disjoint  trees  each  containing  the  terminals  and  each 
containing  precisely  the  same  nodes  as  the  other. 

Lehman  calls  two  (or  more)  sets  which  span  each 
other  cospanning.  Let  us  verify  that  two  disjoint  co¬ 
spanning  sets  A»  and  B»  in  ,V  which  span  k  provide  a 
winning  strategy  in  the  game  Li M,  V,  k)  for  the  short 
player.  All  that  we  need  consider  is  the  span 
mS(Be)  in  M.  Clearly,  we  can  take  Ac  and  Bt  to  be 
bases  of  submatroid  M$:  assume  that  they  are.  If  the 
cut  player  tags  an  element  not  in  .7«Ufi*.  we  can  pre¬ 
tend  that  at  the  same  time  he  also  tags  some  element 
of  AtUBt.  Clearly,  the  short  player  would  not  l>e  tak 
ing  an  illegal  advantage  by  pretending  this.  Therefore, 
suppose  the  cut  player  in  his  first  turn  tags  element 
a*  in  A$. 

By  axiom  2  (in  1.1)  there  is  an  element  6»of  B»  such 
that  (A$  —  OolUfi*  is  a  base  of  .We-  The  short  player 
should  tag  an  element  b%  It  follows  from  prop.  7  that 
disjoint  sets  A,mA$~at  and  B,  **  B,  —  bt  are  spanning 
sets  of  the  contraction  matroid  Mi  mM*~  b %  of  M». 

Since  it  is  the  cut  player's  turn  again,  the  situation 
of  A  and  B i  relative  to  M ,  is  as  it  was  for  A»  and  Be 
rela  e  to  W»  except  that  Mi  is  smaller.  Assuming 
ther.  is  a  strategy  for  the  succeeding  turns  whereby 
the  short  player  can  tag  a  set  of  elements  which  con¬ 
tains  s  b  i»<-  T  of  reduced  matroid  Mi.  then  by  prop  9 


the  .se!  TUbo  of  elements,  which  the  short  player  will 
have  tagged,  is  a  base  of  matroid  M»  and  hence  spans 
set  k  in  matroid  M. 

When  B0  contains  only  one  element  bo.  then  b»  itself 
spans  Mo  and  k.  Hence,  by  induction  on  the  number 
of  elements,  we  have  a  winning  strategy  for  the  short 
player.  This  proves  the  “if  part  of  iheoirem  3.  The 
harder  "only  if’  part  will  follow  from  theorem  4  and 
theorem  5. 

2.4.  Nonahort  Games 

The  notion  of  contraction  can  always  be  used  in 
place  of  the  more  familiar  notion  cf  "matroid  duality," 
and  conversely,  because  of  a  theorem  (3.27  of  [7])  re¬ 
lating  the  contraction  matroids  of  an  M  to  the  sub- 
matroids  of  the  “dual  to  W."  Sometimes  one  notion  is 
convenient,  sometimes  the  other.  Wed.  not  use  dual 
ity  here  Lehman  in  treating  the  same  topic  uses 
mainly  duality. 

Lehman's  interpretation  of  his  dual  results  charac¬ 
terizing  when  the  cut  player  can  win  for  the  case  of 
graphs  does  not  directly  provide  a  “good”  character¬ 
ization  in  the  sense  of  the  absolute  supervisor.  Clearly 
his  characterization  of  a  short  game  is  good  in  the  case 
of  graphs.  However,  he  does  not  give  the  following 
analogous  characterization  for  nonshort  games.  (Com¬ 
pare  Lehman’s  theorem  (26)  and  its  graph  interpreta¬ 
tion  with  our  theorem  4  and  its  contraction  graph  inter¬ 
pretation.  See  also  the  comment  on  his  theorem  (26) 
which  follows  his  theorem  (29).) 

THEOREM  4:  I  |M,  N,  K)  is  a  non-short  game  if  and 
only  if  there  is  a  contraction  matroid  M’  of  matroid  M 
where  set  V  =“  NOM'  can  be  partitioned  into  two  sets 
1,  and  I,  such  that  L  and  l«  are  both  independent  in  M1 
and  such  that  L  does  not  span  the  set  K’  -*  Kfl  M’  in  M'. 

Let  us  verify  that  an  M',  h.  and  lt  provide  a  winning 
strategy  for  the  cut  player  in  game  L(W,  V,  A).  If  / , 
does  not  span  K'  in  W'  then  the  rut  player  can  tag  any¬ 
thing  on  his  first  turn.  Otherwise,  he  should  tag  an 
element  ri  in  h  such  that  L  —  e(  does  noi  span  K'  in  M'. 

Since  h  does  not  span  K',  there  is  an  element  e*k' 
such  that  rie)*0.  If  ee/i,  then  e  is  an  element  et. 
Otherwise,  by  axiom  2'  there  is  a  unique  circuit  C  in 
/iUr  and  so  any  element  of  C  —  e  is  an  element  et. 
iC  —  e  is  not  empty  since  r\e\  w  0.)  Now  neither  the 
untagged  elements  —  et  of  L  nor  the  untagged 

elements  /j  *  /t  of  It  'pan  K'  in  contraction  ma  roid  W'. 

Even  if  the  short  player  tags  an  element  not  in  W', 
clearly  the  rut  player  is  not  taking  an  illegal  advantage 
by  pre'ending  the  short  player  also  tags  an  untagged 
element  it.  M'  if  there  are  any.  Therefore,  assume  the 
short  player  does  tag  one.  say  e,  in  /'.  Consider  the 
contraction  matroid  V*  “  .V  —  et  of  .ft.  (By  prop.  8. 
the  contraction  of  matroid  M’  to  set  V  is  the  same  as 
the  con.raction  of  matroid  M  to  set  ht.)  By  the  cir¬ 
cuit  definition  of  contraction  matroid,  art  /:■»/' -e, 
will  be  independent  in  W*  and  will  not  span  K^K'CtkT 
in  W* 

Again  by  the  definition  of  contraction  matroid,  if  e» 
is  not  in  the  span  of  !\  in  M'  then  /[  is  independent  in 
M In  this  case,  the  cut  player  should  tag  some  ele- 
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ment  ej  such  that  —  e J  dues  not  span  K ’  in  Af". 

By  the  definition  of  contraction  matroid  and  prop.  3, 
if  Ci  is  in  the  span  of  l[  in  matroid  V/’  then  set  1\  con 
tains  just  one  circuit  of  contraction  .Af"  and  does  not 
span  K"  in  \T .  in  this  case,  the  cut  player  should  tag 
some  element  t\  in  the  one  circuit  of  /j  in  M ",  so  that 
—  e\  is  independent  in  AT. 

Thus  in  either  case,  after  the  cut  player  takes  his  sec 
ond  turn,  the  untagged  elements  of  AT  partition  into 
sets  r,  and  1^  where,  in  contraction  .Af",  both  are  inde¬ 
pendent  and  neither  spans  A.’’.  There  are  n*>  elements 
tagged  by  the  short  player  in  Af.  The  situation  is 
identical  to  ‘he  one  in  Af  right  after  the  cut  player  took 
his  first  turn  except  that  Af  has  fewer  untagged 
elements 

Hence,  by  induction  on  the  number  ot  untagged  ele¬ 
ments  in  ihe  contraction  matroid,  if  the  cut  player  tags 
as  described,  be  eventually  reaches  a  contraction 
matroid  Af*1  in  winch  all  the  elcitiecs  are  tagged  by 
him,  and  yet  for  w  hich  there  i'  an  rtk'1"  ~  \th'C\K 
such  that  /1e|  »*  0  in  V*1  (since  k1"  is  not  spanned  by 
the  empty  f*1  or  the  empty  f,*' I.  The  cut  plaver  will 
then  have  won  the  game,  because  for  the  short  player 
to  win  he  must  tag  a  set.  say  T,  which  spans  k  in 
matroid  Af.  By  prop.  7,  for  any  such  T  and  any  set 
AT*'  in  Af,  Tnw*1  must  span  K'k'  —  KO  AT*'  in  matroid 
Af*',  which  is  impossible.  This  proves  the  "if’  part 
of  theorem  4 

2.S.  Coapanning-Swta  Tbsoratn 

We  still  have  to  prove  the  “only  if  parts  of  theorems 
3  and  4.  They  follow  immediatrlv  from  theorem  5 
(for  the  case  A«2).  We  proved  the  part  of  theorem  3 
which  says  "P  (i  is  a  thort  game)"  W'e  proved  the 
part  of  thecrem  4  which  says  “ Q  •+  (I  is  not  a  short 
cune)''.  Theorem  5  says  "r  H  not  </'  Logic  yields 
that  **( L  is  a  short  game)  ■eP*’  and  “<L  is  not  a  short 
game) 

THEOREM  5:  For  ant  matroid  M  and  any  sub¬ 
sets  N  and  K  of  elements  in  M,  thrrr  rust  at  mant¬ 
as  k  disjoint  subsets  of  N  which  j pan  rach  other 
and  which  span  K.  if  ant  unit  if  thrrr  is  na  con¬ 
traction  matroid  \f  of  M  where  NONL  partitions  into 
at  few  aj  k  sets  such  that  rach  is  independent  in  M‘ 
and  such  that  at  least  one  of  them  does  not  span  K  D  M' 

PROOF:  The  “only  if  part  of  theorem  .*>  follows  from 
the  "if  part  of  prop.  7  Suppose  in  matroid  A 1  there 
exist  k  disjoin'  subsets  T,  of  \C  Af.  which  span  each 
other  and  which  span  KC  Af.  Let  Af  lie  any  contrac¬ 
tion  of  W.  Where  a  set  T,  is  the  7"  ol  prop.  7;  where 
set  Af  is  the  .4  and  matroid  .AT  is  the  Af  x  4  of  prop  7; 
and  where  .S',  the  span  iclosurc)  in  A/  of  each  Ti,  is  the 
K  of  prop.  7;  prop  7  say»  that  TlKT,r\.hf‘  spans 
.S' •SOW'  in  matrud  if.  Since  each  77  spana  S', 
each  77  contain*  at  least  rtS’)  elemrr.u  where  rtS’)  is 
the  rank  of  set  S'  in  matroid  Af .  Since  all  the  sets  T", 
are  mutually  disjoint.  YDS'  contains  at  least  k  ■  nS'i 
elements. 

On  the  other  hand,  suppose  A  "  A f '  partitions  into 
as  few  as  k  independent  srts  /,  AT  where  one  of 


them  l\  does  not  span  ADA/'  and  hence  dues  n* >t 
span  S'.  Since  each  /'=/, DS'  is  independent, 
each  /’  contains  at  most  rtS'l  elements.  Since  /’ 
doe?  not  span  S',  it  contains  fewer  than  r(S  )  elements 
Therefore  .VnS'=U/'  contains  lewer  than  k-nS'i 
elements.  Thus,  the  “onlv  if  part  of  theorem 
is  proved. 

The  “if  part  of  theorem  3  follows  fmm  propositions 
H  and  9  and  theorem  1  Let  Af  he  any  matroid.  let 
Y  and  A  be  any  subsets  of  A/,  and  let  k  be  any  positive 
integer.  Suppose  4n  is  a  maximal  subset  of  V  sueh 
that  !  4oi  —  k  -  rl  -f„i  arid  d  'S  A  ■  n.4)  for  all  d  C.d„ 
Set  d0  may  he  emptv  By  theorem  I.  in  partition* 
into  k  independent  sets,  /,.  Since  *■  k  •  rt.4#i. 
each  /;  must  he  a  base  ot  submatroid  da  and  of  course 
also  a  base  of  Sl.-f»t.  the  span  of  da  in  If. 

Let  A/'  be  the  contraction  matroid  ot  \l  obtained  by 
contracting  .Sj.4„>  in  A/.  Suppose  di  is  a  -iibset  ol 
-  Yn  A/'  such  that  |.4ti  =  A  •  n.4|)  and  4!  *  k  ■  ndi 
in  matroid  A/'  for  all  dCd Then  like  do  in  A/. 
4i  partitions  into  k  bases  /'  of  submatroid  4 ,  of  A/’. 
By  j  rop  8,  submatroid  -ft  of  A f  is  the  contraction 
to  d  i  of  the  submatroid  4iUSl,49iof  Af.  Call  it  minor 
.-ft.  By  prop  9,  a  base  of  minor  A ,  together  with  a 
base  of  submatroid  S(do>  of  Af  is  a  base  of  submatioi  ( 
,4iUS(.4«)  of  Af 

In  particular,  by  pairing  tl.e  sets  /'  one-to-one  with 
the  sets  /,,  we  get  k  disjoint  bases  rt  =  l\Ul,  of  sub 
matroid  •fiUSl.-f,,)  of  M  Since  Uf* ^ .-f»U  ft C  A 
and  since  the  »ets  ft  spar*  each  other  in  A/,f.4*iJ.4il 
“  k  ■  rt.40U  4,1  and.  by  the  "only  if  part  of  theorem  I, 
|.4|*A-rt4i  in  Af  for  all  ,4C.4*U.4i  However, 
dr  was  taken  to  be  maximal  for  this  property ,  and  hence 
,4t  is  empty.  Thus,  matroid  Af '  contains  no  nonempty 
.4-  ,  as  dehned. 

Since  S<d»>  is  closed  in  Af,  the  matroid  AC,  obtained 
by  contracting  St,4»).  contains  no  element  of  rank 
lero  h  orollarv  to  prop.  At.  Suppose  Y'*.Yn.Af 
conla-ns  a  nonempty  set  4:  such  that  |.4i| »  Avi,4j) 
in  Af’.  Take  i,  to  be  minimal.  Bv  the  nonexistence 
in  Af  of  a  nonempty  d,  as  described  above,  we  have 
that  f;  'k  ri4j)  Since  there  arr  no  elements  o( 
rank  arm,  .4:  contains  at  least  two  element*  Deleting 
an  element  from  4S  to  get  a  nonempty  .4:,.  we  have 
!  4,1  *  k  ■  rt.4jt  »  A  •  rt.4|i  in  Af  ’,  which  contradicts  the 
minimality  of  4;.  Therefore,  (or  all  nonempty  subsets 
4  o|  A  ‘ ,i  4! <'  A  ■  rt.4)  in  Af  \ 

Suppose  some  element  g«A  is  contained  in  matroid 
Af’.  Smre  ft  iloes  not  hav-’  xrro  rank,  there  rusts  « 
matron!  Af*.  which  contain*  the  element*  of  Af 
plu»  a  new  auxiliary  element  h,  such  that  h  and  t 
form  a  circuit  in  Af*  and  such  that  submatroid  Af,  —  A 
of  Af*  i*  the  matroid  .If  It  is  easy  to  verify  that 
Af*  i*  «uch  a  matroid  where  the  circuits  of  W *.  are 
fli  the  set  consisting  of  f  and  h,  (2)  the  circuits  ot 
Af.  and  t3i  sets  (C-giuA  where  C  is  a  circuit  of 
Af  which  contains  g.  Let  Y»**>.Y‘U h.  it  iollows 
Ir-.im  the  relation!  4  <-  A  ■  n  A)  in  maimid  Af  for  all 
nonempty  4C.Y'.  that  4»'  w  k- rt.4*)  in  Af*  for  ali 
f*  in  A* 


78 


Hence  by  theorem  1,  /V*  can  be  partitioned  into  * 
independent  sets  !)  of  A#*,  including  the  set,  say  /*, 
which  contisns  A.  In  matroid  M'  the  set  /*  — A  is 
independent  and  doe*  not  span  g.  All  of  the  other 
/*  su-p  independent  in  A/'.  These  s t  /*  and  /* 

-A  arc  a  partition  rf  A”. 


Thus,  if  there  is  no  such  partition  of  N,;mKn\!‘ 
for  contraction  Af'  of  Af  then  no  element  of  K  it-  in 
A/'.  Thus  KCS{At}.  In  this  cate,  the  k  bases  /■ 
of  submatroid  SiA„)  of  A/  span  each  other  and  span 
K  in  Af.  This  completes  the  proof  of  theorem  > 

(Paper  69B1  !  .!'•) 
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In  section  1,  transversal  matroids  arc  associated  with  “systems  oi  distinct  representatives”  (i.e., 
transversals!  and,  more  generally,  matching  matroids  arc  associated  with  matchings  in  graphs.  The 
transversal  matroids  and  a  theorem  of  P.  J.  Higgins  on  disjoint  transversals  of  a  family  of  sets,  along 
with  the  well-known  graphic  matroids  and  some  theorem*  on  decomposition  of  graphs  into  forests, 
motivate  some  theorems  on  partitions  of  general  matroids  into  independent  sets.  In  section  2,  the 
relationship  between  transversal  result  and  matroid  Jesuit  is  illustrated  for  a  special  case  oi  later 
theorems.  In  section  3.  theorems  on  transversals  are  proved  using  network  flows.  In  sections  4  and 
5,  theorems  on  matroids  are  presented  which  imply  various  results  on  decomposition  into  transversals 
or  into  forests.  In  section  f,«  the  matching  matroid*.  arc  shown  to  he  simply  the  transversal  matroids. 


For  the  most  part,  sections  2,  3.  4-  5,  ami  f>  can  be 

1.  Transversal  Matroids 

A  matriod  M  —  (E ,  F)  is  a  finite  set  E  of  elements  and 
a  family  F  of  subsets  of  E,  called  independent  sets, 
such  that  (1)  every  subset  of  an  independent  set  is 
independent;  and  (2)  for  every  set  A  C  E,  all  maximal 
independent  subsets  of  A  have  the  same  cardinahty, 
called  the  rank  r{A)  of  A. 

Sometimes  no  explicit  distinction  is  made  between 
a  matroid  and  its  set  of  elements,  in  the  same  way 
that  no  explicit  distinction  is  made  between  groups, 
spaces,  or  graphs  and  their  sets  of  members.  For 
example,  one  normally  uses  the  same  symbol  to  denote 
a  space  and  the  set  of  points  in  a  space.  On  the 
other  hand,  it  is  often  desirable  to  consider  various 
matroids  that  have  the  same  set  of  elements. 

The  primary  example  of  a  matroid  is  obtained  by 
letting  E  be  the  set  of  columns  in  a  matrix  over  some 
field  and  F  the  family  of  linearly  independent  subsets 
of  columns.  In  particular,  E  may  be  the  set  of  edges 
in  a  graph  and  F  the  family  of  edge-sets  that  comprise 
“forests’’  in  the  graph.  A  matroid  that  is  abstractly 
isomorphic  to  one  of  the  latter  kind  is  called  graphic. 

Our  motivation  here  will  he  another  source  of 
matroids,  which  is  an  extensive  theory  in  its  own 
right.  It  is  well  known  in  various  contexts,  including 
systems  of  distinct  representatives,  (0.  I  (-matrices, 
network  flows,  matchings  in  graphs,  marriages,  and 
so  forth  (see  [3]).'  Here  we  will  refer  to  it  very 
broadly  as  transversal  theory. 
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Let  Q={qf,  i=  1 . m }  be  a  family  of  (not 

necessarily  distinct)  subsets  of  a  set  E={ey,  j—l, 

.  .  .  ,  n}.  The  set  r={exi) . ego), 

is  called  a  partial  transversal  (of  size  t)  of  Q  if  T  con¬ 
sists  of  distinct  elements  in  E  and  if  there  are  distinct 
integers  ill),  .  .  .  ,  i(t)  such  that  e;(*> eqxk)  for  k—  1, 
.  .  .  .  t.  The  set  T  is  called  a  transversal  or  a  system 
of  distinct  representatives  of  Q  if  t  =  m. 

THEOREM.  Let  Q  be  any  finite  family  of  ( not  neces¬ 
sarily  distinct )  subsets  of  a  finite  set  E.  (a)  If  F  is 
the  family  of  partial  transversals  of  Q,  then  M*  =  (E,  F) 
is  a  matroid.  (b)  If  F  is  the  collection  of  subfamilies 
of  Q  that  have  transversals ,  then  Mb  =  (Q,  F)  is  a 
matroid. 

The  statements  (a)  and  (b)  are  equivalent  and  refer 
to  the  same  abstract  class  of  matroids  because  the 
roles  of  Q  and  E  are  actually  symmetric.  The  situation 
is  easily  visualized  in  the  form  of  the  “incidence  graph" 
of  ( E ,  Q ):  a  "bipartite"  graph,  G~G(E,  Q),  where  the 
nodes  in  one  part  are  members  of  Q  and  the  nodes  in 
the  other  part  are  the  members  of  E.  The  edges  of  C, 
which  all  go  from  one  part  to  the  other,  are  the  inci¬ 
dences  between  Q  and  E. 

A  transversal  matroid  is  one  that  is  abstractly  isomor¬ 
phic  to  an  Ma  (or  an  Mb).  Matroid  theory  and  trans¬ 
versal  theory  enhance  each  other  via  transversal 
matroids,  as  do  matroid  theory  and  graph  theory  via 
graphic  matroids. 

Let  E  be  any  fixed  subset  of  nodes  in  any  given  graph 
6’.  We  assume  throughout  this  paper  that  each  edge 
of  a  graph  meets  two  distinct  nodes.  Let  subset 
T  C  E  be  a  member  of  F  when  T  meets  (is  contained 
in  the  set  of  endpoints  of)  some  matching  in  G.  (A 
matching  in  a  graph  is  a  set  of  its  edges  such  that  no 
two  members  of  the  set  meet  the  same  node.)  We 
shall  show  that  Ma.s  =  (E,  F)  is  a  matroid  by  verifying 
axiom  (2).  In  general,  where  G  is  not  necessarily 
bipartite  and  where  E  is  any  subset  of  nodes,  we  call 
M<,.  t.  a  matching  matroid.  For  any  A  C  E,  let  T,  and 
Tt  be  maximal  subsets  of  A  which  meet  matchings, 
say  /Vi  and  /Vs,  respectively.  Consider  the  subgraph 
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.V  C  G  formed  by  the  edge-set 

A',  +  Nt  =  lN,-Nt-  U  tNfSt) 

and  the  endpoints  of  its  members.  The  connected 
components  of  V  are  simple  open  and  closed  paths 
because  each  node  of  A  meets  either  one  or  two  edges 
of  ,V.  Set 

Tt  +  T:  =  \T\  —  7*)  U  ( n  —  r , > 

consists  precisely  of  the  path-ends  of  ,V  that  are  in  A\ 

( T\  —  Ti )  are  the  nodes  of  A  that  meet  A\  but  not  A>. 
and  are  the  nodes  of  A  that  meet  N->  but  not 

Nt.  Suppose  T>  is  larger  than  Tt:  then  Ty-T\  is 
larger  than  T\  —  T>.  In  this  case,  some  component  of 
N  must  be  an  open  path,  say  P,  which  lias  one  end  v 
in  Tt~T\  and  the  other  end  not  in  T,  —  T<.  Regarding 
path  P  as  its  edge-set.  N\  +  P  =  (Aj  —  P)  U  (P  —  N\t 
is  a  matching.  This  matching  meets  Tt  in  A  and  in 
addition  it  meets  v  in  A.  Thus,  we  contradict  the 
hypothesis  that  Tt  is  a  maximal  subset  of  A  which 
meets  a  matching.  Therefore,  Tt  and  Ti  have  the 
same  cardinality  and  it  follows  that  =  {F-.  P)  is  a 
matroid. 

General  matching  matroids  are  discussed  in  sec¬ 
tion  6. 

2.  Introduction 

P.  J.  Higgins  [4]  gives  conditions  for  a  family  Q  of 
sets  to  have  k  mutually  disjoint  partial  transversals 
of  prescribed  sizes  nt,  nt,  .  ,  ti*.  In  section  4  we 
present  conditions  for  a  matroid  M  to  have  k  mutually 
disjoint  independent  sets  of  prescribed  sizes  n ■, 
nt . rik.  Where  the  matroid  is  graphic,  for  ex¬ 

ample,  this  result  is  new. 

The  following  two  closely  related  matroid  theorems 
are  presented  in  (1]  and  [2]  as  generalizations  of  the¬ 
orems  by  Nash-Williams  and  Tutte  on  graphs.  The¬ 
orem  2,  below,  for  the  case  of  transversals,  handles 
a  special  case  of  the  Higgins  problem;  it  will  be  gen¬ 
eralized  to  cover  the  Higgins  problem.  Theorem  1  is 
new  for  the  case  of  transversals;  it  will  be  generalized 
analogously. 

Theorem  1.  The  elements  E  of  a  matroid  M  can 
be  partitioned  into  as  feu  as  k  sets,  each  independent 
in  M,  if  and  only  // 1 A  | «  k  -  r<  A)  for  all  A  C  E. 

THEOREM  2.  The  elements  E  of  a  matroid  M  can 
be  partitioned  into  as  many  as  k  sets,  each  a  spanning 
set  of  M,  if  and  only  if  |  A|  2*  k(r(E)  —  r(X))  for  all 
ACE. 

As  usual  \A\  denotes  cardinality  of  set  A,  and  A 
denotes  the  complement  of  A  (with  respect  to  E). 
A  spanning  set  of  a  matroid  M  is  a  subset  of  E  which 
contains  a  maximal  independent  set. 

A  base  of  a  matroid  M  is  a  maximal  independent 
set,  i.e.,  a  minimal  spanning  set.  Each  base  has 
cardinality  equal  to  r\E),  the  rank  of  the  matroid. 

For  any  family  B  of  subsets  of  a  set  E,  a  covering 
in  B  is  a  subfamily  whose  union  is  E,  and  a  packing 
in  B  is  a  subfamily  whose  members  are  disjoint. 


Where  B  is  the  family  of  bases  of  matroid  M,  theorem  1 
describes  the  minimum  cardinality  of  a  covering  in 
B,  and  theorem  2  describes  the  maximum  cardinality 
of  a  packing  in  B. 

Applied  to  a  transversal  matroid  Ms,  where  the 
members  of  a  family  Q  are  the  matroid  elements  and 
where  the  subfamilies  that  have  transversals  are  the 
independent  sets  of  elements.  Theorem  1  says  that 
a  family  Q  of  sets  can  be  partitioned  into  as  few  as 
k  subfamilies,  each  having  a  transversal,  if  and  only 
if  |  A  |  ^  k  •  pi  A )  for  every  subfamily  A  C  Q.  Here 
p(A)  denotes  the  maximum  cardinality  of  a  subfamily 
of  A  which  has  a  transversal,  i.e.,  the  maximum  car¬ 
dinality  of  a  partial  transversal  of  A.  The  statement 
is  not  interesting  when  k—  1;  for  abstract  matroids 
there  is  nothing  interesting  to  say  in  this  case. 

Where  A  is  a  family  of  subsets  of  a  set  E,  where 
NlA’,  A)  is  ihe  (0.  1|  incidence  matrix  of  members  of 
E  (rows)  versus  members  of  A  (columns),  and  where 
G{E,  At  is  the  bipartite  incidence  graph  of  (E,  A),  the 
value  pi. 4)  is  called  the  term  rank  of  A,  A 1{E,  A),  and 
Gl E,  A),  respectively.  One  of  the  two  fundamental 
forms  of  the  fundamental  theorem  of  transversal  theory 
is  due  to  P.  Hall.  It  describes  when  a  family  A  (or  Q) 
itself  has  a  transversal.  The  other  fundamental  form 
of  the  fundamental  theorem  is  Kbnig’s  formula  for 
term  rank;  plAt,  the  maximum  cardinality  of  a  partial 
transversal  of  A  or  of  a  matching  in  G(E,  A)  (i.e., 
a  set  of  l's  which  might  be  called  a  matching  in 
,Y(£.  /!))  is  equal  to  the  minimum  cardinality  of  a  set 
of  nodes  that  meets  all  edges  in  G(E,  A)  (i.e..  a  set  of 
rows  and  columns  that  together  contain  all  l’s  of 

ME,  A)). 

Let  cr(A),  for  A  C  Q,  denote  the  cardinality  of  the 
union  of  the  members  of  A.  It  is  a  consequence  of 
the  Kiinig  formula  for  term  rank  that  the  inequalities 
|,4 1  A-pb4)  for  all  A  CQ  are  equivalent  to  the  in¬ 
equalities  l  A  j «  k-aiA)  for  all  A  C  (?.  Thus  the  latter 
are  also  necessary  and  sufficient  for  Q  to  have  a  parti¬ 
tion  into  k  subfamilies,  each  with  a  transversal. 
When  Ar  =  1 ,  this  is  P.  Hall’s  theorem  on  transversals. 

To  see  this  equivalence,  suppose  that  \A\>  k'p(A) 
for  some  A  C  Q.  In  the  incidence  graph  (A E,  A), 
let  Ei  U  A i,  E\  C  E  and  At  C  A,  be  a  minimum  car¬ 
dinality  set  of  nodes  that  meets  all  of  the  edges. 
By  the  Kiinig  theorem,  piA)~  |£(  |  +  Mi  |.  Let 
At  =  A—A i.  The  set-union  of  members  of  At,  that 
is,  the  other  ends  of  all  the  edges  that  meet  At,  is  E i, 
so  (HAi)  =  jEi  |.  Combining,  we  have 

\At\  =  \A\-\A,\  >Ar<|£,|  +  M,|)-  \A,\ 

=  A-|£i  |  +  (*  —  l)-|/4i  |  »  k-aiAt). 

On  the  other  hand,  clearly  p{A)  <  ofA)  for  all  A  C  Q. 
Therefore,  j  A  |  «  k  -pi  A)  for  all  A  C  Q  is  equivalent 
to  j  A  j  «  k  •  olA)  for  all  A  C  Q.  Thus,  Q  can  be  par¬ 
titioned  into  as  few  as  k  subfamilies,  each  with  a  trans¬ 
versal,  if  and  only  if  the  latter  holds. 

We  do  not  recommend  this  matroid  approach  as  the 
way  to  derive  the  transversal  result.  Theorem  1  in 
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general  i>  not  ea»>,  and,  even  after  it  is  established, 
using  it  with  t lie  Ktirug  theorem  !■>  get  the  transversal 
result  is  no  easier  than  deriving  the  transversal 
result  directly  from  P.  Hall's  theorem  as  follows. 
Let  each  element  e  t  E  be  replicated  k  times  to  obtain 
et,  .  .  .  ,  ekfE'.  To  obtain  Q\  let  q1  eQ'  consist  of 
all  the  replications  of  the  elements  in  qtQ.  Then 
\A\  « k  ■  <r(A)  for  all  ACQ  is  equivalent  to  \A'\ 
«c r(A')  for  all  A'  CQ'.  By  P.  Hall's  theorem  the 
latter  is  equivalent  to  the  existence  of  a  transversal 
for  Q'.  That,  in  turn,  is  equivalent  to  there  being  a 
partition  of  Q  into  as  few  as  k  subfamilies,  each  having 
a  transversal. 

Section  3  presents  a  derivation  of  transversal 
theorems  using  network  flows.  Section  4  presents 
a  different  derivation  of  the  corresponding  matroid 
theorems.  Both  derivations  suggest  computationally 
good  algorithms.  Section  5  presents  another  ap¬ 
plication  of  section  4,  and  section  6  relates  general 
matching  matroids  to  transversal  matroids. 


3.  Transversal  Covers  and  Packings 

In  this  section  we  focus  attention  on  the  transversal 
matroid  —  F),  F  being  the  family  of  partial  trans¬ 

versals  of  Q.  We  shall  use  network  flows  to  derive 
results  on  covers  and  packings  in  F.  For  background 
material  on  network  flows,  we  refer  to  [3].  In  partic¬ 
ular,  the  max-flow  min-cut  theorem  and  integrity 
theorem  will  be  applied.3 

Consider  the  directed  network  shown  in  figure  1. 
In  figure  1  we  have,  in  addition  to  a  source-node  u  and 
a  sink-node  v,  three  tiers  of  nodes:  ei,  e2,  .  .  .  ,  en 
(elements  of  E);  qt,  qi, .  .  .  ,  qm  (subsets  off  that  com¬ 
prise  the  family  Q);  and  pi,  pi,  .  .  .  ,  p*  (partial  trans¬ 
versals).  The  directed  edges  of  this  network  and  their 
flow  capacities  are  listed  below: 


Edges 

Capacities 

(U. 

;-! . n, 

(iu,  et)  =  1, 

(ej,  9i>, 

<.  rrpupondinn  t<>  e,  <  qi. 

cie,.  fil  ». 

(?l.  Pr), 

i  “  1 . m:  r  —  1 . k. 

ciq,,  p ,*■“!, 

(Pr,  V*. 

r-  I . *. 

<lp„  i )  =  n,. 

An  integral  flow  from  source  to  sink  in  this  network 
produces  A  mutually  disjoint  partial  transversals  of 
respective  sizes  si  *  ni,  s2  <  n *,  .  .  .  ,  s*  s  n*  in  the 
following  manner.  Take  a  chain  decomposition  of 

the  flow  and  put  tj  in  pr  if,  for  some  *  =  1,2 . in, 

the  edges  (ej,  qi)  and  {q>,  pA  occur  in  a  chain  of  this 
decomposition.  Conversely,  A  mutually  disjoint  partial 

transversals  of  sizes  is  «  n2 . s*  <  n* 

yield  an  integral  flow  from  source  to  sink.  Using  the 


1  In  ■  graph  where  the  rdpra  t.  ire  directed  and  have  p<.»im*  integer  rapacities  ,■ , 
the  maitmum  number  <>(  chains  (directed  paths,  not  necessanU  distinct*  Imm  a  node  u 
hi  a  node  v,  such  that  each  e,  t*  contained  in  at  most « .  o(  these  chains,  equals  the  mir.imum 
id  the  total  capacity  of  the  edge*  directed  Irom  l  in  t  where  if  .  ?  )  ta  ant  partition  of  all 
tha  node*  into  two  parts  aurh  that  a  #  f  ami  1 1  C  The  family  <d  chains  is  railed  a  i-tum 
deeotnpuaition  of  a  minimum  flow  from  w  to  i  The  set  <d  edges  directed  Imm  a  f  to  r  is 
called  a  cut  aeparaling  source  m  from  sink  < 


Ficire  1. 


integrity  theorem  and  max-flow  min-cut  theorem  lor 
network  flows,  it  follows  that  the  maximum  number  of 
elements  contained  in  a  union  of  k  (mutually  disjoint  l 
partial  transversals  of  respective  sizes  Si  «  ni,  is  ^  nt, 
,  .  .  ,  Sk ■ «  n*  is  equal  to  the  capacity  of  a  minimum 
cut  separating  source  and  sink  in  this  network.  We 
proceed  to  calculate  this. 

Let  A,  B,  C  be  arbitrary  subsets  of  E=  {e>,  e2, .  .  ., 

«»},  Q={qu  q2,  ,  qm },  and  P={pi,  ps . 

p*},  respectively,  and  denote  their  complements  in 
these  sets  by  A,  S,  C.  The  capacity  of  an  arbitrary 
cut  separating  u  and  v  is  then  represented  by  the  sum 


We  wish  to  minimize  this  over  A  C  E,  B  C  Q,  C  C  P. 
Using  the  table  of  edge  capacities,  this  reduces  to 
computing  the  minimum  of 

|d|+l*HCi+2> 

MC 

over  A  C  E,  B  C  Q.C  C  P  such  that  the  set  of  edges 
leading  from  A  to  B  is  empty.  Thus,  for  given  A  and 
C,  we  may  take  B  to  consist  solely  of  those  nodes  of  Q 
which  are  joined  by  edges  to  some  node  of  A.  In  the 
language  of  set  representatives,  B  consists  of  those  ^et* 
represented  by  elements  of  A.  Moreover,  for  C  of 
fixed  cardinality  |C|  =  A  — s,  we  may  take  C  to  corre 
spond  to  the  s  smallest  n  s.  Thus,  choosing  the  nota¬ 
tion  so  that  0  <  ni  <  m  <  .  .  .  <  n*,  and  letting  <x(d) 
denote  the  cardinality  of  the  subfamily  of  Q  represented 
by  elements  of  A,  we  are  led  to  minimizing 

\1\  +  (A-  j)<rt4)+  J)  n' 

r«l 

over  ACE  and  5  =  0,  1,  .  .  .  ,  k.  For  fixed  A,  the 
minimization  over  s  can  be  carried  out  explicitly. 
Indeed,  let  n*  be  the  number  of  integers  among  the  nr, 

r=l,  2,  .  .  .  ,  k,  such  that  nr*j,  j-  1,  2 . 

Thus  (n/j  and  (nr)  are  conjugate  partitions  of  the  integer 
"r-  ^  •*  not  hard  to  see,  especially  in  terms  of  a 
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partition  diagram,  that 


4.  Matroid  Partition 


[I  I  oUl 

(k  —  jVtM)  4-  n'  =  2)  nJ' 

r- l-l  J”  l 

This  proves  that  the  mrximum  number  of  elements  of 
E  contained  in  a  union  of  k  ( mutually  disjoint)  partial 
transversals  of  Q,  having  respective  sizes  si  « m, 
s2  «  02,  •  •  •  ,  s„  *£  n*,  is  equal  to 

_  cKA) 

(*)  min  [|A|+  2  n;]. 

•' c  i-/ 


Here  <r(A)  denotes  the  number  of  sets  in  the  family  Q 
that  are  represented  by  elements  of  A. 

The  following  two  theorems,  which  give  necessary 
and  sufficient  conditions  for  the  existence  of  covers 
and  packings  composed  of  partial  transversals  of  pre¬ 
scribed  sizes,  are  consequences  of  this  result.  (Nash- 
Williams  originated  a  similar  viewpoint  for  related 
theorems  on  matroids.) 

Theorem  la.  Let  Q  be  a  finite  family  of  subsets  of 
a  finite  set  E.  The  family  Q  has  k  partial  transversals 
of  respective  sizes  n(,  n2,  .  .  .  ,  nK  whose  union  is  E  if 

and  only  if  (i)  n,  *£  p(E),  i  =  1,  2 . k,  and  (it)  for 

every  A  C  E,  the  inequality 

<r(AI 

|A|  *  £  n; 

t-i 

holds. 

Here  p(E)  denotes  the  term  rank  of  the  bipartite 
incidence  graph  (or  matrix)  of  elements  of  E  versus 
sets  of  the  family  Q,  that  is,  p{F)  is  the  rank  of  the 
matroid  Ma  —  [E,  F).  The  proof  of  sufficiency  of  (i) 
and  (ii)  makes  use  of  the  fact  that  Ma  is  a  matroid  in 
extending  the  k  partial  transversals  of  sizes  Si  to 
partial  transversals  of  sizes  nt,  i  —  1,  2,  .  .  .  ,  k. 

Theorem  2a.  Let  Q  be  a  finite  family  of  subsets  of 
a  finite  set  E.  The  family  Q  has  k  mutually  disjoint 
partial  transversals  of  respective  sizes  ni,  n 
nk  if  and  only  if,  for  every  A  C  E,  the  inequality 


holds. 

Using  the  Kiinig  theorem  in  an  argument  similar  to 
that  in  section  2  shows  that  the  rank  function  p(A)  of 
matroid  M,  can  be  used  in  place  of  <riA)  in  (*),  hence 
also  in  theorems  la  and  2a. 

The  situation  of  theorem  2a  is  the  problem  studied 
by  Higgins.  His  conditions  are  not  the  same  as  those 
of  theorem  2a,  but  are  instead  stated  in  terms  of  sub¬ 
families  B  of  Q  rather  than  subsets  A  of  E.  They  may 
be  derived  from  theorem  2a  by  use  of  the  Kiinig  theo¬ 
rem  (and  vice  versa),  or  can  be  obtained  directly  by 
eliminating  A  and  C,  rather  than  B  and  C,  in  the  mini¬ 
mization  argument  leading  to  (*). 


Theorem  lb.  The  set  E  of  elements  of  a  matroid 
M  can  be  covered  by  a  family  of  independent  subsets 
It  (i  =  1 ,  .  .  .  ,  k)  of  prescribed  sizes  n,  <  rfE)  if  and 
only  if,  for  every  A  C  E, 

Mi  «  V  min  («(,  rfd)). 

J-i  * 

Theorem  2b.  The  set  E  of  elements  of  a  matroid 
M  contains  mutually  disjoint  independent  subsets 
Ii(i  =  I. .  .  .  ,  k)  of  prescribed  sizu  n,  *£  r(E)  if  and  only 
ijf,  for  every  A  C  E, 

}A I  ss  y  n*  -  min  (n,,  HA))]. 

j- i?T)+i  ' 

Here  ti.A )  denotes  rank  relative  to  matroid  M.  The 
equations  in  theorems  lb  and  2b  aie  obvious. 

Using  lemma  1,  theorems  lb  and  2b  follow  immedi¬ 
ately  from  theorems  lc  and  2c  below. 

Lemma  1.  F or  any  matroid  M  =  (E,F)  and  any  non 
negative  integer  n,  let  F(„)  denote  the  members  of  F 
which  have  cardinality  at  most  n.  Then  M<„'  **  (E,  F-  ) 
is  a  matroid.  Where  r(A )  is  the  rank  function  for  M, 
the  rank  function  for  M(n)  is 

fw(A)m  min  (n,  riA)). 

We  call  M(n>  the  truncation  of  i M  at  n. 

The  proof  of  lemma  1  is  obvious. 

Let  ri(A)  be  the  rank  functions  for  any  family  of 
matroids  Mi**tE,  Ft),  »»1,  .  .  .  ,  A,  on  the  set  E  of 
elements. 

THEOREM  lc.  Set  E  can  be  partitioned  into  a  family 
of  subsets  I|(i  =  l,  .  .  .  ,  k),  where  I|<  F(,  if  and  only 
if  for  every  A  C  E, 

i 

Theorem  2c.  There  is  a  family  of  mutually  dis¬ 
joint  sets  I|(i  =  1 ,  .  .  .  ,  k),  where  L  is  a  maximal  mem¬ 
ber  (base)  in  Ft,  if  and  only  if  for  all  A  C  E, 

\A\  >  £  *£)-£  n(l). 

Where  each  Mt  is  a  graph,  theorem  2c  is  equivalent 
to  a  theorem  of  Tutte  [5]. 

Since  it  can  be  shown  that  a  truncation  of  a  graphic 
or  a  transversal  matroid  is  not  necessarily  graphic 
or  transversal,  theorems  lb  and  2b  for  these  cases  do 
not  follow  from  theorems  lc  and  2c  for  these  cases  as 
in  general.  A  similar  remark  applies  to  the  way  2c 
will  be  derived  from  lc.  Thus  we  observe  that  the 
general  matroid  concept  is  useful  even  where  primary 
interest  is  more  special.  The  proof  of  lc,  on  the  other 
hand,  is  arranged  so  that  the  only  matroids  it  will 
mention  are  those  of  the  theorem.  Hence,  the  proof 
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applies  directly  to  apy  special  class  of  matroids  (in¬ 
cluding  classes  of  one).  Everything  in  references  [1] 
and  (2)  applies  directly  to  the  case  of  only  graphs. 
The  proofs  in  [2]  do  not  apply  directly  to  the  case  of 
only  transversals  because,  as  will  be  shown  at  another 
time,  a  “contraction"  of  a  transversal  matroid  is  not 
necessarily  transversal. 

LEMMA  2.  Let  A  be  any  subset  of  the  elements  of  a 
matroid  M.  Let  I  be  any  independent  subset  of  A. 
A  maximal  set  S,  such  that  I  C  S  C  A  and  r(S)  =  r(I) 
=-|l|,  is  the  unique  set  consisting  of  1  and  etements 
e  e  A  such  that  e  U  I  is  dependent. 

Set  S  is  called  ihe  span  of  1  in  A. 

PROOF.  Consider  eeA  —  l.  By  the  definition  of 
rank,  /  is  a  maximal  independent  subset  of  any  S. 
Thus,  if  e  U  /  is  independent,  then  e^S.  And  thus, 
on  the  other  hand,  if  e  U  /  is  dependent,  then  /  is  a 
maximal  independent  subset  of  e  U  .S.  Hence  by 
axiom  2  for  matroids,  r(e  U  S)~\l\,  and  so  etS. 

LEMMA  3.  The  union  of  any  independent  set  1  and 
any  element  e  of  a  matroid  M  contain. v  at  most  one 
minimal  dependent  set. 

A  minimal  dependent  set  is  called  a  circuit  of  M. 

PROOF.  Suppose  /  U  e  contains  two  distinct  cir¬ 
cuits  Ci  and  Ct.  Assume  I  is  minimal  for  this  possi¬ 
bility.  We  have  e«Ci  D  Cj.  There  is  an  element 
Ci«Ci  —  Ct  and  an  element  e2fC>-C i.  Set  (/  U  e) 

—  (ei  U  e»)  is  independent  since  otherwise  /  — e<  is  a 
smaller  independent  set  than  /  for  which  l l  —  e,)  U  e 
contains  more  than  one  circuit.  Set  /  and  set  </  U  e) 

—  (eiUej)  are  maximal  independent  subsets  of  set 
/  U  e.  This  contradicts  axiom  2. 

Proof  of  lc.  Suppose  that  {/< }  (t  —  1 . A)  is 

a  partition  of  E,  where  IttFi.  Then  for  arbitrary 
ACE, 

\A  n  /,( -  £  rAA  n  L)  *2  r, (A). 

Conversely,  suppose  that  for  every  A  C  E,  the  in¬ 
equality  holds.  Let  {/(}  (*  =  1 ,  .  .  .  ,  k)  be  a  family 
of  disjoint  sets  such  that  It  is  independent  in  M>.  Any 
number  of  these  may  be  empty.  Suppose  there  is  an 


et  E~  U L. 


We  shall  show  how  to  rearrange  elements  among  the 
sets  It  to  make  room  for  e  in  one  of  them  while  ore- 
serving  the  mutual  disjointness  and  the  independence 
of  L  in  Mt.  This  will  prove  the  theorem. 

If  etS  for  any  S  C  E,  then  for  some  f ,  \  h  O  ,S  |  <  n<S ). 
<  Hherwise, 

|5|  »  |  U  (I,  n  S)  U  e| 

m  l-f  y  |/i  n  .s  |  >  ^  ritS'i 

t  i 


Let  So  =  E.  Inductively,  starting  with  j— 1  =  0, 
if  eeS).  i  then  for  some  /«j)  such  that 

|/«p  n  Sj  -  1 1  <  riy)(Sj  -  o, 

we  define  Sj  to  be  the  span  in  Sj  _  t,  with  respect  to 
matroid  Muj\,  of  I«_n  O  S)  -  t.  Since 

tiyiSj)  <  r«j>lSj  _ ,), 

Sj  is  a  proper  subset  of  Sj  -  !•  Therefore  we  must 
eventually  reach  an  Ss  such  that  efSn  and  etSj  for 
0  **j<h. 

(Where  the  matroids  Mi  are  identical,  the  construc¬ 
tion  above  is  the  same  as  the  corresponding  part  of 
the  proof  of  theorem  1  in  [1].  The  rest  of  reference : 
iH  and  |2]  goes  through  essentially  unchanged  for  :/ 
version,  concerning  possibly  distinct  matroids,  which 
includes  theorems  lc  and  2c.  However,  we  continu. 
here  with  a  substantially  trimmed  version.) 

If  eU /,(/n  is  independent  in  Mnei.  the  present 
proof  is  finished.  Otherwise  e  U  /«»)  contains  a  cir¬ 
cuit  (.  of  Mithi-  Set  (e  U  lnhi)  C\  Sh  -  \  is  not  dependent 
in  17/,/,,.  because  then,  by  lemma  2  and  by  the  defini¬ 
tion  of  Sh,  since  eeSh  t.  we  would  have  eeS*.  Thus 
let  m  be  the  smallest  integer,  0<m<h,  such  that 
(e  U  lnm)  H  Sm  is  independent  in  Mum.  There  is  an 
e'tC  —  S,„.  Bylemma3,e  U  /«*)~e'  is  independent  in 

Midi). 

Replacing  7«/,i  by  e  U  /«»>  — e',  we  now  need  to 
dispose  of  e‘  instead  of  e  However,  rve  can  show  that 

sequence  (/an.  ,S'i) . S*J,  with  the  roles  of 

e  and  e'  interchanged,  is  of  the  same  construction  as 
t/idi,  .S’,) . (/,(/,)  Ss),  only  shorter.  Since  the  origi¬ 

nal  e  U  (/jiM  ft  Sj  t)  is  dependent  in  Mu u,  for  all 
j,  1  %  ;  m,  by  lemma  3  we  have  e'tC  C  Sj  -  ■.  Con¬ 
sider  the  terms  t/ipi,  Sj),  1  « ;  =s  m,  one  after  another 
in  order.  Assume  there  is  no  change  in  Sj  .  i.  If 
originally  /ipi  A  him,  then  there  is  no  change  at  all  in 
(/«,,.  Sj).  If  originally  /<<>>  =  /«/-),  then,  even  though 
e  and  e'  are  interchanged  in  /iy>,  by  lemma  2  and  the 
definition  of  Sj,  since  e  U  e'  C  C  C  Sj  - 1,  there  is  no 
change  in  Sj.  Thus  the  theorem  is  proved. 

Proof  of  2c.  For  any  family  of  matroids,  Mt 

~(E,  F, )  (f  =  1 . k),  with  rank  functions  n(A), 

consider  the  additional  matroid  Me  =  {E,  Fo )  where 
the  members  of  F0  are  the  subsets  of  E  that  have  car¬ 
dinality  at  most  |  E  |  -  Vri(A’).  Matroid  Me  is  a  trun¬ 
cation  of  the  matroid  in  which  all  subsets  of  E  are 
independent.  The  existence  of  mutually  disjoint 

sets  /,(i  =  1 . i),  where  A  is  a  maximal  member 

of  F „  is  equivalent  to  the  existence  of  a  partition  of 
E  into  a  family  of  sets,  In  and//  (»=  ).  .  .  .,  k),  such  that 
letFo  and  /,*F j.  By  theorem  lc,  the  existence  of  that 
partition  is  equivalent  to  the  condition  that 

\A\  <  min  (|£| - ^ n(E),  \A\)  +  '^t^A) 


for  all  I  C  F. 


would  contradict  the  hypothesis. 
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That  condition  in  turn  is  equivalent  to 


6.  Addendum  on  Matchings 


\A\  s  \E\-Yrfi£)+'2r,{/t) 

i  < 

for  all  A  C  £.  which  is  equivalent  to 

lAl  ^Y,  n(E)  —  £ri(A) 

i  i 

for  all  A  C  £.  Thus  theorem  2c  is  proved. 

5.  Another  Application 

Let  Ji  (i=l,  .  .  .  ,  A)  be  mutually  disjoint  indepen¬ 
dent  sets  in  a  matroid  M  —  (£,  F).  Let  E'  —  E  —  ( U  J.V 

THEOREM  Id.  Set  E  can  be  partitioned  into  a  family 

of  independent  sets  l|eF  (i=f . k)  such  that 

J,  C  1,  if  and  only  if,  for  every  A  C  E\ 

|A|*S£(HA  UJd-rtJ,)). 

I 

Theorem  2d.  There  is  a  family  of  mutually  disjoint 
bases  I(  ii  =  1,  .  .  .  ,  k)  of  M  such  that  J|  C  l|  if  and 
only  if,  for  every  A  C  E\ 

jA|**£[r(E)-r((E’-A)U  J,)]. 

i 

For  any  matroid  M  —  <E,  F)  and  any  EaCE,  let 
Fa  consist  of  sets  ItF  such  that  I  C  Eo.  Then  M  •  Ea 
*  {Ea,  Fa),  obviously  a  matroid.  is  called  a  submatroid 
of  M  (obtained  from  M  by  deleting  the  elements  of 
Ea**E  —  E0).  The  rank  of  ^subset  of  Ea  is  the  same  in 
M  •  Ea  as  in  M. 

For  any  matroid  M—(E,  F)  and  any  E0  C  E,  let  J  be 
any  maximal  subset  of  Ea  —  E— Ea  which  is  a  member 
of  Fj_  In  other  words,  let  J  be  any  base  of  submatroid 
M  •  Eo.  Let  Fa  consist  of  sets  ItF  such  that  f  C  Eo 
and  such  that  J  U  ItF.  It  follows  easily  from  the 
definition  ot  matroid  that  Fa)  is  a  unique 

matroid,  called  the  contraction  of  M  to  Ea  (obtained 
from  M  by  contracting  the  elements  of  Eo).  Where 
r  and  r0  denote  the  rank  functions  for  matriods  M  and 
MX-Eo.  respectively,  we  have  for  every  A  C  Eo. 

rdA)~riA  U  Eo)~iiEo). 

Theorem  Id  follows  immediately  from  theorem  1c 

by  letting  the  Mi  of  1c  (for  1=0 . k)  be  the 

matroid  obtained  from  matroid  M  of  Id  by  contract¬ 
ing  the  elements  of  J,  and  then  deleting  all  the  other 
elements  of  E  —  E'. 

To  prove  2d  from  2c,  we  obtain  each  Mi  of  2c  from 
M  of  2d  in  the  same  way  as  above.  If,  for  some  i, 
tiE'  U  Ji)  <  HE),  then  no  base  of  M  is  contained  in 
E'  U  Ji  and  so  there  is  no  family  of  bases  h  as  described 
in  2d.  In  this  case  the  inequality  in  2d  does  not  hold 
where  A  is  the  empty  set.  Otherwise,  rtf'  U  Ji)m  rtf) 
for  each  i.  In  this  case,  if  J\  is  a  base  of  Mt,  then 
Ji  U  J'.  is  a  base  of  M.  Thus,  in  this  case,  2d  follows 
from  2c. 


An  element  of  a  matroid  M  is  called  isolated  if  it  is 
contained  in  every  base  of  M,  i.e.,  if  it  is  contained  in 
no  circuits  of  M.  Clearly,  any  number  of  isolated 
elements  can  be  ‘added’’  to  any  transversal  matroid 
Vf„.  thereby  obtaining  another  transversal  matroid. 
With  respect  to  the  graph  representation  (HE,  Q)  of 
\f„,  for  every  isolated  element  c  added  to  Ma,  simply 
add  a  node  e'  to  E  and  join  it  to  a  new  node  q  added 
to  Q. 

Several  elements  of  a  matroid  1/  are  said  to  be  in 
series  with  each  other  either  when  they  are  all  isolated, 
or  else  when  none  of  them  is  isolated  and  each  base  of 
M  contains  all  but  possibly  one  of  them. 

A  set  of  elements  is  in  series  in  matroid  M  if  and  only 
if  the  elements  are  contained  in  exactly  the  same 
circuits  of  Nl. 

Suppose  some  base  /  of  M  contains  neither  of  ele¬ 
ments  e,  and  et  of  M.  Then  /  Ur,  contains  a  circuit 
of  M  that  contains  e,  but  not  ej. 

Suppose  an  element  o  is  contained  in  a  circuit  (. 
of  M  that  does  not  contain  nonisolated  element  e-i  of 
M.  Let  /  be  a  base  of  M  which  does  not  contain  e*. 
The  rank  of  (/  UCl-ri  is  as  large  as  the  rank  of  /; 
otherwise  every  maximal  independent  subset  of  /  U  C 
would  contain  e,,  but  then  et  would  be  contained  in 
no  circuit  in  /  U  C.  Therefore  (f  UC)-(i  contains  a 
base  of  M;  this  ba  e  contains  neither  ei  nor  et.  Thus 
the  theorem  is  proved. 

“Replacing  an  element  et  in  a  matroid  M  by  a  set 

£f={e/ . ef)  of  new  elements  in  series”  yields 

u  matroid  Af>- k).  The  circuits  of  M"- k)  and  the  ele¬ 
ments  of  M[l- k>  are  identical  with  those  of  M  except  that 
e,  is  replaced  by  the  members  of  £}.  Each  base  B  of 
M  which  contains  et  corresponds  to  a  base  \B  —  ei)  U£f| 
of  M"-  Each  base  B  of  M  which  dots  not  contain  et 
corresponds  to  A  base*  of  V"  **  of  the  form  B  u  E\ 

—  e|,  j  —  1 . A.  We  omit  proof  that  AT*-**  is  a 

matroid,  which  is  not  difficult  using  the  description  of 
the  bases. 

For  any  transversal  matroid  M„  containing  element 
e(,  the  matroid  M1,'1  Kl  is  also  transversal. 

Let  Mn  lie  represented  by  a  bipartite  graph  (l 
=  QE,  (At  as  described  in  section  1;  a  base  of  ,W„ 
consists  of  the  endpoints  in  E  of  a  maximum  cardi 
naiiiy  matching  in  G.  By  thinking  of  bases,  it  is  easy 
to  see  that  we  obtain  from  0  a  similar  representation 
G"- k>  for  matroid  M'f  as  follows.  Replace  node 
eitE  of  G  by  the  set  £*  of  new  nodes.  Join  each 
ej  eft’s  to  the  same  nodes  in  Q  to  which  et  was  joined. 
Also  add  to  Q  a  set  Q‘  of  A  —  1  new  nodes,  each  joined 
to  precisely  the  members  of  E*.  We  then  have  Cu-*’. 
A  base  of  matroid  DFJ-  **  consists  of  the  endpoints  in 
(£  -e,)  U  EJ  of  a  maximum  cardinality  matching  in 

Clearly,  if  A  C  E  for  matching  matroids  Mu,  a  and 
Mu.k.  then  M,;.  i  is  the  submatroid  of  Mu.  *  whose  set 
of  elements  i*  4.  Clearly,  any  submatroid  of  a  trans¬ 
versal  matroid  is  transversal. 
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Every  matching  matron i  is  a  transversal  matron!. 
(Thus,  the  two  classes  of  matroids  are  abstractly  the 

same.) 

In  view  of  ih.  preceding  observations  on  sub- 
matroids.  it  suffices  to  show  that  where  G  is  any  graph 
and  where  V  is  all  of  its  nodes,  Me.  »  is  a  transversal 
matroid.  Clearly,  B  is  a  base  in  matroid  Mr.,  i  if  and 
only  if  B  is  the  set  of  endpoints  of  some  maximum 
(cardinality)  matching  L  in  G. 

Section  6  of  [7]  implies  the  following  theorem  (which 
essentially  strengthens  some  other  known  theorems, 
a  characterization  by  Tutte  of  graphs  in  which  no 
matching  meets  all  the  nodes,  and  a  formula  by  Merge 
for  what  we  regard  here  as  the  rank  of  \1n.  »•). 

(*)  From  any  gruph  G,  by  deleting  the  set  J  of  nodes 
Hihich  meet  every  maximum  {cardinality (  matching 
and  deleting  all  the  edges  which  meet  J.  the  remainder 
consists  of  connected  components,  Q.  containing  re¬ 
spectively  2ii+l  nodes  where  r,  is  an  integer,  f  G 
is  bipartite,  each  0,  is  a  single  mode.)  Let  Q  consist 
of  the  nodes  u  in  J  which  in  G  are  joined  to  at  least 
one  node  in  U  0|.  Every  maximum  matching  in  G 
contains  r,  edges  in  0(,  for  each  i.  and  contains  an 
edge  joining  u  to  a  node  in  U  0,,/or  each  u«Q. 

What  is  actually  proved  in  |7|  is  theorem  (*)  where 
‘Every”  is  replaced  by  “Some”  in  the  last  sentence. 
However,  because  each  0,  has  an  odd  number  of  nodes, 
because  every  edge  leaving  an  0,  goes  to  a  ut(J,  and 
because  each  edge  has  two  ends,  it  is  easy  to  sec  that 
any  matching  which  is  not  as  described  in  the  theorem 
meets  fewer  nodes  in  U  0,.  Hem  e,  it  has  smaller 
cardinality  than  the  matching,  described  in  the  theo¬ 
rem,  which  is  proved  in  [7]  to  exist. 

(Unless  some  matching  in  G  meets  every  nod* ,  there 
are  more  OTs  than  there  are  us.  The  theorem  of 
Tutte  says  that  a  graph  contains  no  matching  that 
meets  all  of  the  nodes  if  and  only  if  there  exists  a  sub¬ 
set  Q  of  the  nodes  such  that  deleting  (J  anti  its  inci¬ 
dent  edges  from  G  leaves  more  than  i(Jl  components 
which  have  odd  numbers  of  nodes.) 


For  any  graph  G,  whose  node  sei  ia  V,  tfce  «st/  C  v 
defined  in  t*)  is  the  set  of  isolated  elements  in  matroid 
M<„ \  .  Denoting  the  set  of  nodes  in  Of  by  £i,  theorem 
(*)  says  that  each  maximum  matching  meets  all  but 
possibly  one  node  in  £t:  thus,  set  £*  is  in  series  in 
matroid  M,„  t  By  “contracting”  the  subgraphs  Of  to 
single  nodes  o,  comprising  a  set  £,  and  then  by  delet¬ 
ing  J  —  Q  and  all  edges  which  do  not  meet  an  ei,  we 
obtain  from  G  a  bipartite  graph  QE,  Q). 

Let  M„  be  the  transversal  matroid,  with  set  £  of 
elements,  associated  with  QE,  Q).  It  follows  easily 
from  theorem  (*)  that  matroid  Mr,,  v  is  obtained  from 
matroid  M„  by  replacing  each  e(  by  the  set  £i  in  series 
and  by  adding  set  J  of  isolated  elements. 

The  structure  of  transversal  matroids  and  some 
other  related  matroids  will  be  further  described  in  a 
later  paper. 
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Note  added  in  proof:  Theorem  I.  the  subject  of  (1|. 
generalized  here,  was  proved  for  ihe  case  where  the 
matroid  is  a  set  ol  vectors  in  a  vector  space  by  Alfred 
Horn  [  A  characterization  of  unions  of  linearly  inde¬ 
pendent  sets.  J.  London  Math.  Soc.  30  (1955),  494- 
496 1  and  by  H.  Rado  (A  combinatorial  theorem  on 
vet  lor  spat  es,  J.  Loudon  Math.  Soc.  37,  (1962),  351- 
353).  In  the  Abstracts  of  Short  Communications, 
International  Congress  of  Mathematicians,  Stockholm 
1962.  p.  47,  Rado  remarks  that  “This  theorem  is  of 
interest  since  in  contrast  to  other  propositions  on 
vector  spaces  its  proof  ha*  not  vet  been  extended  to 
abstract  independence  relations  I  (H.  Whitney,  Amer. 
J.  Math.  1935.  K.  Rado.  Canadian  J.  Math.  1949).  It 
remains  to  decide  if  (i)  the  theorem  is  true  for  all  1, 
or  <ii)  its  validity  constitutes  a  new  necessary  condi¬ 
tion  for  representability  of  I  in  a  vector  space.” 
Theorem  1  confirms  (it. 
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Cmmmmiealtd  bp  R.  CmrsM,  Dtetmbtr  ft,  1984 
We  will  refer  to  the  ordinary  linear  programming  problem 

maximise  i»  -  ex  (1) 

Ax  ■  J,  i  ^  0 

as  problem  PI.  In  (1)  b  is  an  integer  m-vector,  e  ia  an  ss  n  vector,  and  A  is  an 
m  X  (a*  +  *)  integer  matrix.  *  ia  an  s*  +  n  vector,  all  of  whose  oompooenta  are 
required  to  be  nonnegative.  We  assume  that  A  mot  the  form  (A1,  /)  with  /  an 
et  X  si  identity  matrix,  so  that  in  (1)  Ax  »  b  is  equivalent  to  the  si  inequalities  in  * 
variable id's'  £  6.  We  will  say  that  x  is  feasible  if  it  satisfies  the  equality  and  non* 
negativity  oooditionu  of  (1)  and  optima!  if  it  also  maximises. 

A  problem  closely  related  to  PI  is  the  integer  programming  problem  PS  which  ia 
Pi  with  the  added  condition  that  the  components  of  x  be  integers  Because  of  the 
comparative  earn  with  which  PI  ia  solved1  and  the  comparative  difficulty  of  PS,*-  * 
it  is  natural  to  oooaidor  getting  from  the  solution  of  PI  to  the  solution  to  PS  by 
some  sort  of  a  "rounding"  procam  through  which  the  nonintsgur  compensate  of  the  a 
solving  PI  are  rounded  either  up  or  down  to  produce  a  eolation  to  PS.  Tide 
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procedure  seems  particularly  plausible  when  the  components  x,  of  x  are  reasonably 
large  numbers.  However,  it  is  easily  shown  by  examples  that  a  nearest-neighbor 
rounding  process  cannot  generally  produce  the  optimal  solution  to  P2.  These 
examples  are  neither  pathological  nor  uncommon;  it  is  simply  not  the  case  that  the 
optimal  solution  can  be  obtained  by  simple  rounding  to  some  vector  z'  with  \x’t  — 
Zg|  <  1,  even  if  the  rounding  is  followed  by  some  sort  of  optimization  on  the  residual 
problem  and  even  if  the  b  and  x  of  (1)  become  arbitrarily  large. 

Nevertheless,  there  is  a  close  connection  between  the  optimal  solutions  to  PI  and 
P2  for  a  wide  range  of  right-hand  sides  b.  We  first  give  some  theorems  on  this 
connection  and  then  an  algorithm  which  for  these  6  obtains  the  optimal  solution  of 
P2  from  the  optimal  solution  to  Pi. 

If  B  is  a  basis,  i.e.,  an  m  X  m  nonsiugular  submatrix  of  A,  we  will  assume  that  A 
has  been  rearranged  and  partitioned  into  matrices  B  and  N  with  A  »  (Bfl). 
We  will  also  partition  z  *  ( x* ,  xN)  and  c  »  (c*,  cN).  The  columns  of  A  will  be 
referred  to  as  a{,  B  =•  (at,  . . a»).  We  confine  ourselves  to  right-hand  side 
vectors  6  in  that  part  of  m-space  for  which  (1)  is  solvable.  If  B  is  the  optimal  basis 
for  PI  with  right-hand  side  6,  then  it  is  also  the  optimal  basis  for  all  b'  such  that 
B~lb'  >  0.  These  6  form  a  cone  in  m-space,  and  in  fact  all  solvable  m-space  is 
partitioned  into  such  cones  KB.  On  removing  from  KB  all  points  within  a  distance 
d  of  its  boundary,  we  have  the  reduced  cone  KB(d).  With  this  notation  we  can 
now  state  Theorem  1. 

Thkorem  1.  Let  l  -  max  |!«,!|,  *  •  m  +  1,  . . .,  m  -f-  n ,  D  •  jdet  P!,<md*,(6) 
6e  the  value  of  the  eolution  to  Pi.  Then  if  btKB(l(D  —  1)),  the  value  *»{&)  of  the 
volution  to  PS  i*  given  by 

**(&)  -  *i(b)  +  **(b),  (2) 

and  an  optimal  volution  vector  iv  p-’vtn  by 

x(b)  -  (x.(6),  xAb))  -  (P->(6  -  Nfm,  y*(6)),  (3) 

where  both  the  vcaiar  function  o*{b)  and  the  n+ocior  function  y*(b)  art  m-penodic,  i  t., 
W*(h  +  ad  *  w*(6), »  -  1,  ,  m,  and  y*(b  +  «,}  -  y*(6), »  «  I, .  ,  m. 

The  periodicity  means  that  the  values  of  a*(b)  and  y*(b)  depend  only  on  the 
position  of  b  relative  to  the  lattice  £s  of  points  generated  by  integer  combinations  of 
«t,  This  is  equivalent  to  saying  that  **  is  r  function  on  the  factor 

module  hf(f)/M(B)  when  Af(f)  is  the  module  of  all  integer  points  in  m-space  and 
V(B)  the  module  of  integer  combinations  of  the  e,f  i  -  1,  . ,  m. 

Although  (2)  and  (3)  have  just  the  form  one  would  expect  if  rounding  were 
possible,  the  integer  solution  x(b)  is  generally  not  a  continuation  of  a  rounded 
nearest-neighbor  solution.  Iv  is  instead  a  continuation  from  a  point  p  «  6  - 
N |r*(5>  which on  £,.  A  measure  of  the  distance  from  p  to  6  is  given  by  Theorem 
2. 

Tuscan*  2.  If  biK*(l(D  -  /)),  then  the  optimal  volution  seder  z(6)  has  the 
property 

E  -  £  Vi  £  &  -  i. 

«••»  +  !  *•! 

We  next  dtscucs  the  arithmetic  work  involved  in  actually  obtaining  **(6)  and 
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y‘(b).  The  calculation  may  be  broken  into  two  parte,  of  which  the  first  is  a 
standard  calculation. 

The  factor  module  M{I)/M{B)  is  a  finite  additive  group  having  D  elements. 
By  the  methods  of  references  4  or  5  we  calculate  M(I)/M(JEf)  as  the  direct  sum  of 
cyclic  groups  of  known  orders.  The  arithmetic  work  involved  is  bounded  by 
2m(m*  +  2m)logjZ)  Or,  alternatively,  given  B~l  as  a  starting  point,  the  standard 
form  of  M(I)/M(B)  can  be  obumed  in  at  most  2r(m*  -f  2m)iogiJ)  arithmetic 
steps,  where  r  <  m  is  the  rank  of  \f(I)/M{B).  If  the  factor  group  is  cyclic,  r  is  1 . 

This  calculation  also  provides  explicitly  a  means  of  mapping  integer  m- vectors 
onto  corresponding  elements  of  M(I)/M(B) .  If  we  call  this  mapping  f,  then  f>  «=»  /p 
can  be  obtained  for  any  integer  m- vector  p  by  at  most  m1  +  2m  arithmetic  steps. 

Since  both  <e*(b)  and  yB(b)  are  m-periodic,  one  can  obtain,  by  periodicity,  values 
<p*(b)  and  y‘(b)  for  all  b  if  they  are  known  for  one  period. 

Theorem  3.  There  are  D  distinct  b  values  in  one  period.  If  M(1)/M(B)  has  been 
put  in  ata.idard  form  and  the  group  elements  a<  =  fa,  obtained  for  i  =  m  +  1,  . . . , 
n  +  n,  then  the  values  of  <pB(b)  can  be  computed  for  all  b  in  one  period  in  less  than  7nD 
elementary  arithmetic  steps.  The  values  of  y*(b)  for  a  particular  b  can  be  computed  in 
n  more  steps  or  the  values  for  all  b  in  one  period  in  nD  more  steps. 

The  arithmetic  steps  referred  to  here  are  operations  such  as  the  addition  and  sub¬ 
traction  of  real  numbers,  comparison  of  real  numbers,  or  the  addition  and  sub¬ 
traction  of  elements  of  M(I)/M(B).  We  now  turn  to  the  proofs  of  these  theorems. 

In  reference  2  it  was  pointed  out  that  M(I)/M(B )  is  isomorphic  to  the  group  F 
generated  by  the  rows  of  the  matrix  B~lA  with  the  entries  being  replaced  by  the 
corresponding  entries  modulo  1.  These  "fractional  rows”  then  provided  the  basic 
inequalities  for  the  methods  of  reference  2  and,  in  a  less  evident  way,  for  reference  3. 
As  was  remarked  in  reference  2,  similar  reasoning  shows  that  M(I)/M(B)  is  also 
isomorphic  to  the  group  generated  by  the  columns  of  B~'A  with  coefficients  being 
treated  the  same  way,  i.e.,  reduced  to  proper  fractions.  As  B~lA  is  the  simplex 
tableau  provided  by  the  simplex  method  in  solving  (1),  each  column  has  associated 
with  it  a  relative  cost  factor.  This  representation  suggests  the  following  problem 
involving  maximization  over  M(I)/M(B). 

i  «■» 

max  L  c*t+my, 

1 

»•»» 

E  *(+«!/<  «  5,  Vi  >  o  and  integer.  (4) 

i-i 

Here  a<  and  h  are  the  elements  of  M(l)/M(B)  corresponding  to  the  vectors  a,  and  b, 
and  c*t  is  the  relative  cost  c*t  *»  c,  —  cBB~'at. 

It  is  a  fundamental  property  of  linear  programming  that  all  c*t  associated  with  an 
optimal  basis  are  <0;  so  the  maximum  in  (4)  does  exist  for  optimal  bases  (though 
not  for  other  bases). 

Since  the  a{tAf(I)/M(B),  a  group  with  D  elements,  2)&(  -  6;  so  for  a  minimal 
solution  to  (4)  it  is  only  necessary  to  consider  y\  satisfying  0  <  yt<  D.  We  will 
indicate  later  how  (4)  can,  in  fact,  be  solved  for  all  5  in  a  total  of  7 nD  elementary 
steps. 

If  b  is  an  integer  m-vector,  define  <p{b)  as  the  value  of  the  solution  of  (4)  with  l  ** 
fb.  This  is  the  of  Theorem  1. 
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One  of  the  properties  of  ?  is  immediate.  Clearly,  ?(6  +  at)  «  <p(b),  i  «  1, . . . ,  m. 
Abo  v(b)  +  cgB-'b  >  *,(&).  For  >f  (x'*,*'w)  is  &  feasible  irteger  solution  to  (1), 
then  the  corresponding  cost  c(x'b,  x'n)  ~  cgx'g  +  Cnx'n  can  be  expressed  in  terms 
of  the  x'N  only  by  using  the  relation 

Bt'b  +  Nx'„  ~  b,  (5) 

which  yields 

c{x'b,  x's)  =  rBB~]b  +  (cN  -  c«B-‘A0xV 

is  »i(b),  so 

c(x'«,  t'at)  =  Z\(b)  +  E  «VV  (6) 

Applying  the  homomorphism  /  to  (5),  B  disappears  and  we  get 

i*m  4*« 

2  i  0, 

t  sr  . 

so  a/w  is  a  feasible  solution  (4).  Heme 

«  “■«  +n 

E  cV'.  <  w(6); 

«  -m  +  l 

so  from  (6),  c(x’B,  x'n)  <  'i(b)  4-  <p(b),  and  since  this  holds  for  all  such  (x'B,  x'n), 

*(&)  <  zi(b)  +  *(&).  (7) 

Now  iet  us  consider  the  y  that  solves  (4).  For  this  y 

%  ■» 

E 

»-i 

We  extend  this  to  a  solution  to  (5)  by  choosing  xB  -  5_,(6  -  Ny)  Since  y 
solves  (4),  (b  —  By)t£B,  so  xB  will  be  integral.  If  xB  is  also  nonnegative,  (xB,  y) 
solves  (1)  and,  in  fact,  is  the  optimal  integer  solution  since  its  cost,  by  (6),  is ei(l)  4* 
*(6),  which,  by  (7),  establishes  optimality. 

To  establish  conditions  for  the  nonnegativity  of  xB  we  need  the  following  lemma. 
Luma.  There  it  an  optimal  solution  to  (4)  with 

Z  Vi<D-l. 

i-i 

Proof:  If  the  y<  are  the  components  of  that  optimal  solution  having  E?<  minimal, 
form  any  sequence  of  the  a,+„  in  which  each  a,+w  appears  exactly  yt  times.  Then 
form  the  partial  sum  S,  of  the  first  p  elements  of  the  sequence.  We  include  5*  »  6. 
If  the  sequence  has  more  than  D  —  1  elements,  there  are  more  than  D  so  there 
must  be  a  p  and  p',p<p'  for  which  S,  *  S9>.  The  elements,  between  p  and  p'  in  the 
sequence  total  6  and  can  be  deleted.  The  remaining  elements  form  a  new  solution 
which  contradicts  either  optimality  or  minimal  total  Elf<- 
A  related  argument  can  be  used  to  prove  the  following  which,  though  not  used  in 
the  proofs  of  Theorems  1,  2,  or  3,  bears  on  the  multiplicity  of  solutions  to  integer 
programs. 
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I  -» 

Thkorcm  4.  If  H  (y,  +  1)  >  D  —  1,  then  the  eolution  to  (4)  vs  not  unique. 

t 

Returning  to  the  proof,  it  follows  from  the  lemma  that  jjJVyjl  <  (D  —  1)1.  Hence, 
if  h«K*(l(D  -  1)),  6  -  Ny  will  be  in  Ka  and  so  x»  will  be  nonnegative.  This 
establishes  Theorem  1. 

Theorem  2  now  follows  at  once  from  the  lemma. 

To  establish  Theorem  3,  we  now  turn  to  the  actual  computation  of  y.  Guided  by 
dynamic  programming,*  we  define  <?,(P)>  p*M(I)/M(B)  as  the  solution  to 

i  -* 

max  £  c*4+«J/< 

<-i 

i  *• 

22  8<+*y<  ”*  P- 

t-i 

We  have  recursively 

<p,(J>)  =  max  -  ai+«)  +  c*,+«,  ?,-:(£)}.  (8) 

Let  us  assume  that  <p,-i(P)  is  known  for  all  PtM(I)/M(B).  Then  we  can  compute 
recursively,  starting  with  <p,{p)  =  0, 

*>(».+-)  -  max  {#>,(©) +e  *»+«»>  ?,_»{«,+„) } 

*(«!*«)  »  max  {*,(ra.+„  -  »,+»)  -f  cV-,  ?,-!(«»,+«>  ),r  *  1, 2,  . .  ,,D  -  1. 

If  a,  is  of  order  D,  we  will  obtain  all  values  <fi,(P)-  If  a.  is  of  some  order  d  which 
divides  D ,  then  da,  «  o,  and  after  d  steps  we  return  to  5.  One  then  choc  see  eome  f> 
not  yet  reached  in  the  calculation  (the  standard  form  of  the  group  is  needed  here), 
and  setting  <f>',(p)  =  <p,-dP)  computes  <f>',(P  +  ra,+»)  »  max  { +  r\+m  — 
an-)  +  c*rt«,  vdp  +  rat+J|  r  -  1,  .  . ,  d.  After  d  steps,  da,+„  »  6,  so  we  ob¬ 
tain  a  new  value  for  v'dp)  and  then  continue  obtaining  new  values  for  ?',(£  -f  ra,+«) 
the  second  time  around.  As  soon  as  one  of  these  new  values  agrees  with  the  old,  the 
calculation  is  stopped.  It  is  not  hard  to  sho^  that :  (t)  th^  calculation  will  stop 
after  q  steps  d  <  q  <  2d;  (it)  the  <p\(p  +  ra,+-)  values  are  the  correct  values 
<fi,(p  +  r&i+m)-  This  procedure  is  repeated  for  D/d  starting  points  p  to  get  values 
<fi,(jP)  for  all  ptM([)/M(B). 

If  Af  <  min  c*„  we  can  start  with  <pt(f)  -  M  for  all  p.  Then  repeating  the 

t 

calculation  leads  to  the  calculation  of  *9(p)  for  all  p  in  at  most  2nD  elementary 
recursions  each  involving  adding  two  group  elements,  looking  up  two  values,  adding 
two  real  numbers,  and  making  one  compare.  To  obtain  the  optimal  solution  with 
the  smallest  one  simply  records  with  each  »,(6).  when  computed,  the  total 

i 

T,(P)  ->  of  the  y(  of  that  solution.  Clearly,  T,(P)  «  T,_dP)>  if  the  second  lerir 

I 

gives  the  maximum  in  (8)  and  T,(p )  “  T,(P  —  *#+•)  4- 1,  otherwise.  In  case  of  a  tie 
in  the  maximum  in  (8) ,  the  term  yielding  the  smaller  T,(P)  value  should  be  chosen. 

The  solutions  y{  are  obtained  by  tracing  back  the  recursion  in  the  usua1  manner  of 
dynamic  programming.  By  proper  recording,  backtracking  can  be  done  even  if  the 
?r-i  values  are  discarded,  once  the  <p,  are  known.  These  backtracking  operations 
are  virtually  identical  with  those  used  in  solving  the  knapsack  problem,*  and,  in  fact, 
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work  done  with  P.  C.  Gilmore  on  knapsack  problems  strongly  suggested  the  results 
of  this  paper. 

Finally,  we  note  that  the  following  steps,  (t)  solve  PI  obtaining  the  opitmal  B, 
(it)  put  M(I)/M(B )  into  standard  form  and  identify  the  &t,  ( m )  solve  (4)  obtaining 
y,  (iv)  compute  x$  *  B~\b  —  Ny),  will  yield  an  optimal  solution  (*#,  y)  if  x§  >  0. 
It  is  not  necessary  for  6  to  be  in  K‘(l(D  -  1))  to  apply  the  procedure.  The  prob¬ 
lems  for  which  the  procedure  provides  a  solution  are  those  for  which  those  in¬ 
equalities  binding  the  solution  of  PI  alone  determine  the  solution  to  P2. 

*  This  work  was  supported  in  part  by  the  Office  of  Naval  Research  under  contract  Nonr  3775- 
(00),  NR  047040. 

1  Dantiig,  George  B.,  Linear  Programming  and  Retention*  (Princeton,  N.  J.:  Princeton  Uni¬ 
versity  Press,  1963). 

*  Gomory,  R.  E.,  “An  algorithm  for  integer  solutions  to  linear  programs,”  Recent  AdeanCe*  in 
Mathematical  Programming,  ed.  R.  L.  Graves  and  Philip  Wolfe  (McGraw-Hill,  1963),  pp.  360-302. 

*  Gomory,  R.  E.,  “All-integer  integer  programming  algorithm,  ”  Industrial  Scheduling,  ed.  J.  P. 
Muth  and  G.  L.  Thompson  (Prentice-Hall,  1963),  pp.  193-206. 

4  Zaesenhaus,  Hans,  The  Theory  of  Croupe  (New  York:  Chelsea  Publishing  Co.,  1949). 

*  van  der  Waerden,  B.  L.,  Modem  Algebra  (New  York:  Frederick  Ungar  Publishing  Co.,  1950), 
vol.  2. 

*  Bellman,  R.,  Dynamic  Programming  (Princeton,  N.  J.:  Princeton  University  Press,  1957). 

1  Gilmore,  P.  C.,  and  R.  E.  Gomory,  “Multi-stage  cutting  stock  problems  of  two  and  more 
dimensions,’’  to  appear  in  Operation*  Re*. 


Reprinted  from  the  Proceedings  or  the  National  Academy  or  Sciences 
Vol.  87,  No.  1,  pp.  18-18.  Jenuery,  1067. 


FACES  OF  AN  INTEGER  POLYHEDRON * 

By  R.  E.  Gomory 

THOMAS  J.  WATSON  RESEARCH  CENTER,  YORE  TOWN  HEIGHTS,  NEW  YORK 
Communicated,  by  H.  Courant,  November  1, 1866 

In  reference  5  a  connection  was  given  between  the  integer  and  noninteger  solu¬ 
tions  to  the  linear  programing  problem 

maximize  z  =  ex  ( 1 ) 

Ax  —  b,  x  >  0. 

In  (1)  x  is  an  m  +  n  vector,  6  is  an  integer  m- vector,  c  an  m  -j-  n  vector,  and  A  an 
m  X  (m  +  n)  integer  matrix  containing  an  m  X  m  identity  matrix.  A  is  assumed 
to  be  rearranged  and  partitioned  into  an  m  X  m  optimal  basis  matrix  B  for  the 
noninteger  problem  and  a  collection  of  nonbasic  columns  forming  the  matrix  N 
with  A  **  (B,N).  An  alternative  form  of  (1)  that  is  useful  here  for  geometric  inter¬ 
pretation  is  to  revert  to  inequalities,  writing  A  as  (A',  I).  Then  (1)  becomes 

maximize  z  «*  c'x'  (la) 

A'x'  <  b 

where  x’  and  c'  are  n-vectors. 

Under  suitable  conditions,  given  in  reference  5,  the  integer  solution  to  (1)  could 
be  obtained  from  the  noninteger  one  by  solving  the  optimization  problem 

»  — H 

max  £  c*t  t 

iml 

subject  to  the  conditions 

£  Qtt i  -  0o  (2) 

i-i 

where  the  tt  are  required  to  be  nonnegative  integers.  The  c*  (which  are  not  im¬ 
portant  here)  are  the  relative  cost  coefficients  associated  with  the  columns  of  N,  and 
Qt  is  the  element  of  the  factor  module  g  *>  M(I)/M(B)  corresponding  to  the  ith 
column  of  N.  Here  M{I)  is  the  module  of  all  integer  m- vectors,  and  M(B)  the 
module  generated  over  the  integers  by  the  columns  of  B. 

The  connection  between  integer  and  noninteger  solutions  established  by  (2) 
held  only  under  certain  conditions.  One  way  to  develop  this  approach  into  a 
general  integer  programing  algorithm  would  be  to  develop  from  (2)  new  inequalities 
or  “cutting  planes"  for  a  method  similar  to  that  of  reference  6.  The  geometrical 
interpretation  of  the  solutions  to  (2)  suggests  that  this  is  possible  and  this  approach 
is  outlined  here. 

To  see  this,  consider  the  cone  P'  in  the  spaoe  of  the  variables  x'  of  (la)  formed  by 
using  only  the  inequalities  corresponding  to  the  nonbasic  variables  or  equivalently, 
P'  is  the  cone  obtained  if  in  (la)  the  nonnegativity  condition  for  the  basic  variables 
is  dropped.  Within  P'  is  the  polyhedron  Pf  which  is  the  convex  hull  of  the  integer 
points  of  P’.  P"  is  an  interesting  object  of  Btudy  in  itself.  In  addition,  its  faces 
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clearly  provide  the  strongest  inequalities  or  cutting  planee  for  the  general  integer 
programing  problem  that  can  be  deduced  locally,  i.e.,  without  using  the  nonlocal 
information  available  from  the  nonnegativity  condition  on  the  basic  variables 

Since  the  variables  t  of  (2)  determine  a  corresponding  x  satisfying  the  equations 
of  (1)  by  t-+x  *  ( B~l(b  —  Nt),t )  and  hence  also  determine  the  x‘  of  (la),  inequalities 
on  the  U  yield  inequalities  on  the  x',  and  in  this  sense  one  can  talk  about  an  in¬ 
equality  on  the  U  being  a  face  of  P'. 

Faces  to  P"  can  be  characterized  by  the  following  easily  proved  theorem  which 
allows  their  computation  by  linear  programing. 

Theorem  1.  The  inequality  Xrttt  >  tt0  is  a  face  of  P *  if  and  only  tf  the  t,  are  a 
basic  feasible  solution  of  the  system  of  inequalities, 

■kT  >  To  (3) 

made  up  from  all  vectors  T  =  ( t, ,  . . . ,  <„),  satisfying  the  equations  of  (9). 

A  number  of  remarks  can  be  made  about  the  feasibility  of  this  computation. 

First:  Although  there  are  an  infinity  of  T  satisfying  (2)  and  hence  an  infinity  of 
inequalities,  it  is  easy  to  reduce  this  to  a  finite  number  by  considering  only  the  ir¬ 
reducible  T,  i.e.,  those  T  not  containing  as  a  vector  V  *  (tif  . . .,  f„)  for  which 
«  0.  Or  alternatively  one  can  work  only  with  those  T  that  satisfy  (2)  for 
some  nonnegative  c*. 

Second:  The  trivial  faces  of  Pw,  those  that  are  simply  faces  of  the  original  prob¬ 
lem  (1),  can  be  discarded  by  choosing  r0  **  0. 

Third:  The  multiplicity  of  rows  in  (3)  can  be  dealt  with  by  a  row-generating 
method  similar  to  the  methods  of  references  1 , 3, 4,  and  7.  Because  of  this,  annXn 
basis  matrix  is  the  most  that  is  required  at  any  time.  The  needed  row  at  each 
simplex  step  can  be  generated  (at  worst)  by  solving  a  problem  approximately 
equivalent  to  (2),  essentially  a  shortest  path  problem  over  the  group  0. 

Fourth:  There  is  a  simple  way  of  getting  a  first  feasible  solution  to  (3),  and  hence 
a  face  of  P ",  by  solving  a  single  problem  like  (2)  with  an  additional  side  calculation 
that  less  than  doubles  the  work.  (For  an  estimate  of  the  work  involved  in  solving 
(2),  see  ref.  5.)  This  calculation  will  not  be  described  here. 

Fifth:  Duplication,  i.e.,  many  columns  of  N  mapping  into  the  seme  group  ele¬ 
ment,  can  easily  be  taken  care  of  and  simply  reduces  the  size  of  the  problem  to  be 
dealt  with. 

In  addition  to  providing  a  method  of  computing  faces  of  P*  by  linear  program¬ 
ing,  equations  (3)  lead  to  the  proof  of  the  following  theorems  which  involve  con¬ 
sidering  the  tree  of  shortest  paths  over  the  group  G. 

Let  gt,  . . . ,  gn  be  the  group  elements  corresponding  to  the  columns  of  N.  Choose 
from  among  the  gt  a  basis  (we  can  assume  it  is  git  . . . ,  g„)  so  that  G  »  0i  0  0*  0 
. . .  0  g9,  the  direct  sum.  The  remaining  gt  and  the  element  gt  corresponding  to  b 

«»F 

can  then  be  represented  as  p- vectors  with  respect  to  this  basis;  so  gk  —  £  y*,<g< 

and  the  following  theorem,  whose  proof  is  not  given  here,  holds. 

Theorem  2.  If  for  some  basis  gx, . . . ,  g„  the  representation  of  go,  go  m  2 ytt(gi  has 
a  component  s  for  which 

7o.i  >  max*  >  ,  yk,„ 

then  the  rt  given  by 
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*•#  »  yo,„  r,  *  1,  Tt  «  >  p,  and  wk  -  0 

otherwise  give  a  face  of  P *. 

The  condition  of  the  3  always  met  whenever,  for  some  component  «, 

7*.f  is  exactly  one  lees  than  the  order  of  g,.  In  particular,  we  have  the  corollary. 

Corollary.  If  g  ie  the  direct  sum  of  cyclic  groups  of  order  E,  then  any  basis  of  g 
satisfies  the  conditions  of  Theorem  B. 

It  is  easily  shown  that  all  A  consisting  of  columns  with  at  most  two  nonzero  en¬ 
tries  that  are  restricted  to  be  1,  or  —1,  yield  groups  G  that  are  direct  sums  of 
cyclic  groups  of  order  2.  This  connects  with  the  work  of  Edmonds.1 

In  the  next  theorem  we  refer  directly  to  the  shortest  path  tree.  In  a  graph,  if 
any  unambiguous  method  of  breaking  ties  among  paths  is  used  so  that  there  is  a 
unique  shortest  path  between  two  points,  the  shortest  paths  from  one  point  to  all  the 
others  will  form  a  spanning  tree.  However,  different  tie-breaking  methods  produce 
different  trees.  If  the  elements  of  g  are  taken  as  nodes  of  a  graph,  and  if  the  gu  i 

-  1 . .  are  taken  as  directed  arcs  connecting  each  gf  to  the  point  g'  +  g(l  we 

have  a  graph  and  hence  for  any  choice  of  *■<,  *  «  1,  . . . ,  n,  a  shortest  path  tree. 

In  what  follows,  b'  denotes  a  right-hand  side  in  (1)  and  g'  is  the  group  element  fb' 
under  the  natural  mapping/  which  sends  Af  (/)  onto  G.  We  can  now  state  Theorem 
3. 

Theorem  3.  If  R  is  a,  shortest  path  tree  for  the  r(  forming  a  face  of  P",  and  if  g' 
m  fb'  is  separated  from  0  in  R  by  go,  then  the  rt  are  also  a  face  for  the  polyhedron  P" 
resulting  from  the  right-hand  side  b'. 

The  »#'  corresponding  to  6'  can  be  obtained  by  adding  the  tree  distance  from  g0 
to  g'  to  the  original  t0. 

Finally  we  state  a  theorem  that  allows  the  computation  of  faces  on  a  once-and- 
for -all  basis,  independent  of  the  particular  columns  present  in  the  matrix  M . 

Let  the  faces  F}  be  all  faces  of  the  higher-dimensional  integer  polyhedron  P *  ob¬ 
tained  from  (2)  by  letting  the  index  i  in  (2)  range  over  all  group  elements. 

Fi  "  (*■«■<»  vi.<,  *■*,(,  •  ••  v -  (4) 

Theorem  4.  The  faces  F/,  obtained  by  deleting  from  (4)  all  rfi  whose  corresponding 
group  element  is  not  a  column  of  N,  include  all  faces  of  the  original  polyhedron  P*. 

*  This  work  wm  supported  in  part  by  the  Office  of  Naval  Research  under  contract  Nonr  3775- 
(00),  NR  047040. 
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ON  COMPLEMENTARY  PIVOT  THEORY 


by 

C.  E.  Lemke 


INTRODUCTION 

These  notes  constitute,  in  the  main,  an  exposition  of  the 
results  contained  in  References  anu  £  5  and  further, 

incorporate  some  thoughts  of  the  author  and  resu'ts  of  others 
which  have  accrued  since.  An  effort  is  made  to  make  the  present¬ 
ation  simple  and  instructive,  to  stress  the  reliance  on  and 
relation  to  well-known  procedures  of  linear  progranming,  and 
to  emphasize  the  unifying  and  generalizing  aspects  of  the  results. 

We  are  concerned  with  the  existence  and  computation  of 
solutions  to  the  following  system  of  constraints: 

(1)  w  *  q  +  Mz; 

(2)  w^0;z^0; 

(3)  v'z  -  0  (-  Vj*!  +  w2l2  +  ...  +  wn*n); 

for  various  given  square  matrices  of  order  n,  and  column  q. 

Prime  denotes  matric  transposition.  The  above  problem  involves 
2n  variables,  restricted  to  be  non-negative,  (w^,  z^)  is  a 
complementary  pair;  and  w^  and  are  complements  of  one 

another.  A  feasible  solution  for  which  at  least  one  of  each 
pair  is  equal  to  zero  (that  is,  w^z^  *  for  each  i;  which  is 
equivalent  to  (3)  )  is  a  desired  complementary  solution.  We 
shall,  following  Cottle  and  Dantzlg  (Ref. £2  J),  refer  to  the 
problem  of  finding  solutions  to  this  system  as  che  ’'Fundamental 
Problem". 

The  form  is  fairly  general  in  che  sense  that  many  problems 
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nay  be  so  posed.  Outstanding  examples  are  (as  shown  in  the 
appendices)  the  problem  of  solving  tve  convex  quadratic  pro¬ 
gramming  problem  (which  includes  the  linear  programming  prob¬ 
lem),  and  the  problem  of  finding  a  Nash  equilibrium  point  for 
bimatrix  games.  The  results  of  Section  B  encompass  the 
construction  and  proof  of  solution  for  these  and  other  prob¬ 
lems.  The  computational  schemes  are  'complementary  pivot 
schemes',  which  consist  of  a  sequence  of  pivots  starting  from 
system  (1),  or  from  that  system  augmented  by  a  single  additional 
variable.  This  extends  the  use  of  such  schemes. 

A.  PRELIMINARIES. 

In  this  section  we  develop  concisely  the  well-known 
essentials  of  pivotal  schemes  in  a  form  most  useful  to  the 
development  in  Section  B,  where  the  schemes  and  proofs  are 
given. 

One  is  basically  concerned  with  a  system  of  m  linear 
equations  in  m+n  unknowns: 

(4)  .  -  q  .  te;  (-  q  *  Vl  *  V,  ♦ 

where  A  has  order  n  by  n.  Bars  above  variables  denote 

explicit  values  of  the  variables.  A  set  of  m+n  values 

(w,  1)  is  a  solution  if  and  only  if  it  satisfies  (4):w  -  q  +  fa. 

A  solution  is  feasible  if  and  only  if  it  is  non-negative: 

(5)  w>  0;  and  z  >  0. 

L  denotes  the  set  of  solutions;  K  the  set  of  feasible  solutions. 
K  may  be  empty.  Geometrically,  L  is  a  linear  manifold  of 
dimension  n  in  (m+n)-space.  K  is  a  convex  polyhedron  (an 
Intersection  of  half-spaces).  Throughout,  geometric  descrip¬ 
tions  are  not  necessary  to  understanding,  and  may  be  ignored. 
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Ir.  (4),  L  is  described  by  giving  the  right-side 
(independent)  variables  values  and  computing  the  left-side 
(dependent)  variable  values.  We  are  interested  in  alternative 
descriptions  of.  L  having  the  same  form  and  obtained  from  (4). 

Aj  denotes  the  jth  column  of  A;  A ^  ^  denotes 
the  component  in  row  i  and  column  j  of  A.  Suppose  that 

(6)  As  r  /  0. 

We  may  then  perform  a  pivot  on  the  form  (4)  which  consists  of: 

(i)  solving  the  sth  equation  of  (4j  for  the  variable  zr  (which 
involves  a  division  by  Ag  r;  called  the  pivot ) ,  and 

(ii)  replacing  zr  by  the  result  in  each  of  the  remaining 
m-1  equations  of  (6). 

The  result  of  the  pivot  is  again  a  form  like  (4), 
giving  an  equivalent  description  of  L.  The  left-hand  set 
of  m  variables  differs  from  the  previous  left-hand  set  in 
one  component ;  the  pair  (wg,  zf)  have  been  exchanged ;  and 
specifying  th*n  pair  (called  the  pivot  pair  )  completely 
specifies  the  pivot. 

We  may  then  perform  a  pivot  on  the  result;  and  continue 
in  a  sequence  of  pivots.  A  pivotal  scheme  on  (4)  consists 
of  a  sequence  of  pivots  together  with  criteria  fo."  (a)  deciding 
the  pivot  pair  on  each  pivot  and  (b)  deciding  when  to  termin¬ 
ate  the  pivoting. 

A  Sequence  of  pivots  leads  to  an  equivalent  description 
of  L  given  by  the  resulting  fora: 

<7)  - 1‘  ♦  ***‘f  <-  <»*  *  44  *  44  *•••*44  >: 

so  that  (»*,  zt)  is  always  some  permutation  of  the  variables 
(w,  z).  We  now  accumulate  the  relevant  definitions  and  facts. 
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A  basic  set  is  a  set,  call  it  w*,  of  m  of  the  n+m  variables 

(%,  z)  such  that  wt  may  be  obtained  from  (4)  in  the  form  (7) 

(and  by  a  sequence  of  pivots).  The  corresponding  set  z"  of 

the  remaining  n  variables  is  the  associated  non-basic  set. 

Given  the  basic  set  w*,  the  form  (7)  is  unique,  except  for 

the 

permuting  the  equations  and/or  the  terms  of /tight -most ,  ex¬ 
panded  expression  for  wt .  (7)  is  called  a  basic  form, 

if  one  has: 


(8) 


r ,  s 


0 


then  a  pivot  on  (7)  defined  by  the  pivot  pair  (w„,  z^)  yields 
a  new  basic  form  called  adjacent  to  (7).  Conversely,  two 
basic  forms  are  adjacent  if  and  only  if  their  basic  sets 
differ  by  only  ore  variable,  (hence  may  be  obtained,  one 
from  the  other,  by  a  single  pivot).  It  follows  that  a  basic 
form  (7'  has  as  many  adjacent  basic  forms  as  there  are 
non-z  vo  components  in  A*.  Two  basic  sets  are  adjacent  if 
th  ir  corresponding  basic  forms  are  adjacent,  (the  equivalence 
of  basic  forms  by  permuting  equations  or  non-basic  variables 
is  always  assumed). 

Referring  to  (7),  the  unique  point:  (w  ,  z  )  =  (q  ,  0), 
obtained  from  (7)  by  setting  all  non-basic  variables  z*  to 
zero  is  called  the  associated  basic  point. 

A  basic  point  has  at  least  n  zeros  (since  "z*  =0). 

L  is  non-degenerate  if  and  only  if  any  solution  (w,  "z)  has 
at  most  n  of  the  m+n  values  w^,  ~z ^  equal  to  0.  Alternatively 
and  equivalently,  L  is  non-degenerate  if  and  only  if  every 
basic  point  has  exactly  n  zero  values  (exactly  m  non-zero 
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values).  We  assume  throughout  that  L  is  non-degenerate. 

We  shall  however  always  make  note  of  any  effects  of  non-de¬ 
generacy.  An  immediate  effect  is  the  following: 

A  solution  has  exactly  n  zero  components  if  and 
only  if  it  is  a  basic  solution;  hence  corresponding  to  each 
basic  set  is  one  and  only  one  basic  solution:  and  each 
basic  solution  corresponds  to  one  and  only  one  basic  set. 

Hence,  in  each  basic  form  (7)  q*  has  no  zero  components. 

Two  basic  solutions  are  adjacent  if  their  basic 
sets  are  adjacent. 

Referring  to  (7),  for  fixed  j  all  solutions  satisfying: 

(9)  wt  =  qt  +  z*A* 

3  J 


will  be  called  a  basic  line;  obtained  from  (7)  by  taking 

all  non-basic  variables  except  zt  equal  to  zero.  Points 

on  a  basic  line  have  either  n  or  n-1  zero  components 

(m  or  m+1  non-zero  components).  If  in  (9)  some  value  of  zt 

J 

makes  a  component  of  j*  zero,  the  corresponding  solution 


has  exactly  n  zero  components;  hence  is  a  basic  solution 


and  in  fact  an  adjacent  solution  to  the  basic  solution 
t  t,  t 

(w  ,  z  )  =  (q  ,  0) ,  corresponding  to  (7).  Hence,  all 
adjacent  solutions  lie  on  basic  lines.  With  reference  to  (9) 


for  future  reference  let  us  (i)  let  e.  be  a  column  whose 

3 

jth  component  is  whose  other  components  are  0;  (ii)  write 


(9)  as: 

(10) 


and  re-parmute  variables  to  their  original  order. 


(10)  become $ 
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where  (w,  z)  is  still  the  basic  solution  corresponding  to 
(7).  In  order  that  this  satisfy  (4)  for  all  0: 

(12)  v  -  Au  | 

and  (v,  u)  has  at  least  n-1  zero  values. 

Our  next  remarks  refer  to  feasibility  and  the  set  K. 
A  feasible  basic  form  is  a  basic  form  whose  basic  point  is 
feasible,  called  a  basic  feasible  point.  We  indicate  such  a 
form  by: 

(13)  w*  =  q*  +  A^z^;  q*  0 . 


A  basic  feasible  point  is  an  extreme  point  of  K.  A  feasible 


pivot  is  a  pivot  from  one  feasible  basic  form  to  another 
feasible  basic  form.  We  point  out  that  corresponding  to 
each  non-basic  variable  z*  at  most  one  feasible  pivot  from 

J 

(13)  is  possible.  Referring  to  (9),  now  restricting  z*  to 

J 

non  negative  values,  since  q*  >  0,  z*j  may  be  increased 
from  0  while  retaining  w*  ^  0. 


time,  say  for  z*  =  z*  some  component,  say  w* 


Either  for  the  first 

becomes  0| 

in  which  case  the  corresponding  point  is  an  adjacent  feasible  basic 

point,  and  a  feasible  pivot  on  (13),  defined  by  the  pivot 

pair  (w^,  z^)  may  be  performed;  or  else  solutions  satisfy- 
s  j 

ing  (9)  are  feasible  for  all  z*j  ^  0;  which  is  true  if  and 

only  if: 


(14) 


A1  > 

J  - 


0. 


In  the  former  case,  solutions  satisfying  (9)  for  the  interval 
0  <.  z“  <  z*  are  feasible,  and  form  a  bounded  edge  of  K, 


<  *5  < 


whose  two  end-points,  defined  by  the  end-values  of  the  interval, 


are  called  adjacent  extreme  points. 


In  the  latter  case, 
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all  solutions  satisfying  (9)  for  z*  are  feasible  and 
form  an  unbounded  edge  of  K.  In  particular,  exactly  n 
edges  of  K  meet  in  an  extreme  point;  and  each  extreme  point 
has  at  most  n  adjacent  extreme  points  —  each  feasible  basic 
form  has  at  most  n  adjacent  feasible  basic  forms. 

A  feasible  pivotal  scheme  consists  of  a  sequence  of 
feasible  pivots.  In  these  notes  we  will  be  limited  to  feas¬ 
ible  pivotal  schemes  such  that,  given  a  feasible  form  (13), 
a  non-baaic  variable,  say  z*,  is  first  selected.  The  pivoting 
terminates  if  A*  >  0;  othercwise  the  pivot  is  uniquely 
defined  by  z*.  Geometrically,  a  feasible  pivotal  scheme 
defines  an  adjacent-extreme-poifct  path. 

Finally,  we  define  a  proper  pivotal  scheme  as  one 
for  which  no  basic  set  appears  twice.  Since  the  number  of 
basic  sets  is  finite,  a  proper  pivotal  scheme  always  termin¬ 


ates. 
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B.  Complementary  Pivot  Schemes  and  Applications. 

Ve  now  consider  the  Fundamental  Problem,  (1),  (2),  (3). 
We  consider  three  sfchemes  which  are  essentially  alike.  Each 
scheme  consists,  except  for  one  or  two  initial  pivots,  of 
feasible  pivots.  Schemes  II  and  III  apply  directly  to  L. 
Scheme  I  requires  augmenting  L  by  one  (pseudo)  variable. 

All  schemes  are  proper  schemes;  hence  terminate.  Following 
the  description  of  each  scheme;  a  class  of  problems  for 
which  the  scheme  applies  is  defined;  and  proofs  are  given. 

L  is  now  the  set  of  solutions  to  (1);  K  is  the 
set  of  non-negative  solutions.  L  is  assumed  to  be  non¬ 
degenerate.  A  cpmplementary  solution;  reeqniring  Wj^  *  0 
thus  has  at  least  n  zero  components;  by  non-degeneracy 
at  most  n  zero  components;  hence  exactly  n;  and  hence  is 
a  basic  feasible  point.  The  number  of  complementary  solutions 
is  thus  finite.  Let  C  denote  the  set  of  complementary 
solut ions. 

Scheme  I.  Let  zQ  be  a  scalar  variable;  e  >  0  denote 
any  column  with  positive  components;  and  let  L'  denote 
the  set  of  solutions  to: 

(15)  w  -  q  +  zQe  +  Mz  -  q  +  Az; 

where  A  -  (e,  M) ;  z  -  (z°) ’  Non-basic  sets  thus  have  n+1 
components. 

Let  K'  be  the  set  of  feasible  solutions  of  L’. 

Let  CQ  denote  the  set  otfpoints  of  K’  satisfying: 

$ 

0 

(16)  w’z  -  0  (that  is,  w^  -  0;  i  *  1,  2,  ...,  n). 


We  shall  generate  a  proper  feasible  sequence  whose  basic 
points  satisfy  (16). 

Either  q  ^  0,  in  which  case  the  basic  point  of  (15) 
is  already  a  complementary  solution  for  L.  and  no  pivots  are 
required;  or  else  q  has  some  negative  component,  which  we 
assume  to  be  the  case. 

On-  the  first  pivot,  zQ  is  increased  until  for  the 
first  time;  w  -  q  +  zQe  ^  0;  in  which  case  some  wf  be¬ 
comes  zero.  The  first  pivot  is  defined  by  the  pivot  pair; 

(17)  (wr,  zQ). 

This  leads  to  the  basic  form: 

(18)  w1  -  qt  +  A V;  qt>  0, 

for  t  «  1  The  basic  feasible  point  satisfies  (16).  Assoc- 

complementarfr 

iated  with  (18)  is  a  non- basic/ pair ;  in  the case  t  -  1 
the  pair  (wf,  zf) .  As  zr  the  complement  of  wr  is  increased 
in  (18)  (16)  remains  satisfied.  If  a  pivot  making  zr  basic 
can  be  made;  this  pivot  becomes  the  second  pivot,  and  leads 
to  the  feasible  form  (18),  for  t  =  2.  If  the  pivot  cannot 
be  made,  the  sequence  is  terminated. 

In  general,  suppose  that  t  ^  1  pivots  have  lejLd  to 
the  feasible  form  (18);  and  suppose  that  (16)  was  satisfied 
for  all  basic  feasible  points  generated.  If  zQ  is  non-basic 
a  complementary  solution  has  been  found;  namely  the  final  basic 
point,  and  the  sequence  terminates.  If  zQ  is  still  basic, 
suppose  that  the  variable  that  became  non-basic  on  the  tth 
pivot  was  one  of  the  complementary  pair  (we,  z _) .  Since 
condition  (16)  prevailed,  both  components  of  this  pair  are 


non-basic.  By  counting,  this  is  the  only  such  pair  --  the 
non-basic  complementary  pair  associated  with  the  current 
basic  form. 

The  complement  of  the  one  of  the  pair  which  just 
became  non-basic  is  to  be  increased.  Either  a  unique 
t+lst  pivot  is  thus  specified;  or  the  sequence  is  terminated. 
This  completes  a  description  of  the  scheme. 

It  remains  to  point  out  that  the  scheme  generates 
a  proper  scheme;  that  no  basic  set  occurs  twice. 

First  consider  (18)  for  t  «  1.  wf  has  just  become 
non-basic.  We  may  assume  that  w_  is  z*  (the  first  com- 
ponent  of  z1).  By  examining  the  first  pivot  it  is  easy  to 
see  that  A^  (the  first  column  of  A*)  has  non-negative 
components.  Let  us  label  as  Eq  the  unbounded  edge  of 
points  of  K’  obtained  for  all  non-negative  values  of  wr» 

Since  1r  -  0  All  points  of  EQ  are  in  CQ;  i.e.,  satisfy 
(16). 

Now  for  any  t  >  1  consider  (18);  where  zQ  remains 
basic.  Let  (w  ,  z  )  be  the  associated  non-basic  complementary 
pair.  A  feasible  pivot  from  (18)  to  a  feasible  form  satis¬ 
fying  (16)  is  only  possible  by  increasing  one  or  the  other 
of  this  pair.  When  (18)  is  not  a  terminal  basic  fora  (either 
t  -  1  or  a  final  t)  there  are  exactly  two  adjacent  basic 
forms  satisfying  (16). 

Therefore  there  can  be  no  first  basic  set  that  repeats, 
for  if  a  basic  set  repeats  the  immediately  preceding  basic  set 
must  have  repeated.  Hence  the  scheme  is  proper. 

Observe  that  the  scheme  may  be  carried  out  for  any 
II  and  q.  Either  the  scheme  terminates  in  a  complementary 
solution  of  L,  or  in  an  unbounded  edge  of  points  of  C^. 
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In  the  latter  case,  let  us  label  that  unbounded  edge  £. 

Let  us  observe  that  £  cannot  be  £Q.  The  only  access 

to  EQ  Is  by  way  of  the  basic  point  of  basic  form  (18)  for 

t  -  1;  and  this  form  does  not  repeat.  If  (an  extreme  case) 

the  sequence  terminated  with  t  =  1,  then  E  must  have  been 

the  set  of  points  obtained  from  (18)  by  increasing  zr.  In 

any  case  £  and  E  are  distinct, 
o 

In  general,  if  the  sequence  terminates  in  E,  no 
conclusions  are  possible.  We  now  however  isolate  a  class 
of  matrices  M  such  that  termination  in  E  implies  that 
K  is  empty,  for  the  given  q. 

Co-positive  matrices  are  (square)  mattices  M  such  that: 

(19)  z  >  0  implies  that  z’Mz  ^  0. 

Co-positive-plus  matrices  are  co-positive  matrices  such  that: 

(20)  z  ^  0  and  z’Mz  *=  0  imply  that  (M+IT)z  *  0. 

The  class  of  co-positive  matrices  include,  most 
importantly,  posit ive-seml-def inite  matrices  (z’Mz  >  0;  all  z), 
and  non-negat lve  matrices  (M  >  0;  that  is,  all  components 
non-negative).  There  are  also  ways  of  compounding  co-positive 
matrices.  For  example,  a  non-negative  linear  combination  of 
co-positive  matrices  is  co-positive. 

The  class  of  co-positive-plus  matrices  also  includes 
posit ive-semi-def inite  matrices,  and  the  class  of  strictly 
co-positive  matrices  (z  >  0,  and  z  t  0  imply  z’Mz  >  0): 

which  includes  posit ive  matrices  (M  >  0).  Again,  co-positive- 
plus  matrices  may  be  compounded  to  form  co-positive  matrices. 
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For  example,  positive  linear  combinations  of  co-positive-plus 
matrices  are  co-positive-plus.  If  M1  and  Mg  are  co-pos- 
itive-plus,  then  so  is  MJ,  and,  since  anti -symmetric  matrices 
(IT  -  -M;  or,  equivalently,  z'Mz  -  0;  all  z)  are  positive- 
semi-definite,  as  observed  by  Cottle  (see  Ref.^J  ),  the 
matrix  fu, ,  -A'l  is  co-positive-plus,  for  any  A. 

K  “2] 

In  particular,  to  ascertain  whether  or  not  M  is 
co-positive-plus  it  is  sufficient  to  test  its  symmetric  part 
£(MtM’);  since  one  is  co-positive-plus  if  and  only  if  the 
other  is.  Thus  a  study  of  the  class  could  be  restricted  to 
the  symmetric  sub-class:  co-positive  symmetric  matrices 
for  which:  z  >  0  and  z'Mz  *  0  imply  that  Mz  «  0. 

Let  denote  the  class  of  co-positive-plus  matrices. 

(The  term  'co-posit ive-pl us ’  and  notation  'C  '  were  suggested 
by  Cottle.) 

Theorem  1.  Let  M  be  in  C+ .  Then  Scheme  I  terminates  either 
in  a  complementary  solution  of  L  or  in  the  conclusion  that 
for  the  given  q  no  feasible  solution  exists. 


Proof:  We  will  suppose  that  the  Scheme  terminated  in  the 

unbounded  set  E  contained  in  C  ,  and  show  that  then  tlere 

o’ 

is  no  feasible  solution.  Let  (18)  designate  the  final  basic 
fora.  We  are  supposing  that,  for  some  r,  all  points  of  L  sat¬ 
isfying  : 

(21)  w*  -  q*  *  z:*A*  for  z*  >  0; 

w**«re  points  satisfying  (16);  where  A*  >  0. 

By  (10),  (11),  (12)  E  may  be  described  as  the  set 


w  ♦  0  7; 


7  *  8  y;  zq  *  ZQ  *  0  uo. 


(22) 
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for  0  >  0;  where  (z,  w)  is  the  basic  feasible  point 

corresponding  to  the  final  basic  form. 

Ve  draw  the  following  conclusions: 
a.  Since  E  is  assumed  to  exist;  both  Case  1 :  u  -  0 
and  Case  2.1  uQ  -  0  cannot  hold.  We  show  that  Case  i 
cannot  bold  and  that  Case  2  must  hold. 

]>.  For  w,  z,  zQ  to  satisfy  (15)  for  all  0  >  0  requires: 
(23)  *v  -  TIoe  +  Jfu. 

£.  For  w,  z,  zQ  to  remain  non-negative  for  all  0  ^  0 


requires : 


u  >  0;  V  >  0;  uQ  ^  0. 


jd.  That  w'z  -  0  hold  for  all  0  >  0  requires  (since 
all  terms  in  (22)  are  non-negative): 


z '  w  -  z'v  -  w  ’  u  -  u’v  -  0. 


If  u  -  0:  (23),  (24)  imply  that  "v  -  ”uoe  0. 

By  (25):  "z’7  -  0  implies  that  "z  -  0.  Hence  w  «  zQe 

described  E.  But  this  is  precisely  the  set  Eq.  Hence 
U  **  0.  For  simplicity  we  normalize  "u  by:  e’lu  -  1. 

We  may  write: 


(26) 


z’w  «  z'Mz  ♦  z’(q  zQe)  ” 


e.  Now  suppose  that  N  is  co-positive.  By  (26): 


(27) 


f(Q)  -  -z’(q  ♦  zQe)  *  z’Mz  >  0; 


where  f(0)  is  quadratic  in  0.  In  order  for  (27)  to 

* 

hold  for  all  0  >  0,  requires  that  the  coefficient  of  0 4 

be  non-negative: 


i 
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(28)  -"uoCu'e)  =  -uQ  >  0; 

which  is  possible  only  if  u  «  0. 

Thus  f(0)  is  linear  in  0,  and  for  (27)  to  hold  for 
all  0^0  requires  that  the  coefficient  of  0  be  non-negative 

(29)  -"u'(q  +  zQ)e  >  0;  or:  u'q  ^  -zo  K  0; 

the  latter  Inequality  since  we  are  assuming  that  zQ  was 
basic  at  the  final  basic  form. 

Summarizing,  if  M  is  co-positive,  the  assumption 
of  E  implies  a  column  "u  satisfying: 

(30)  l(u  ^  0;  "u  >  0;  u'q  ^0;  and  "u'llu  -  C. 

f.  Finally,  assume  that  M  is  co-positive-plus.  Then  II **u 
-  -ifu  ^  0.  Then  for  any  z  >  0  and  w  *  q  +  Mz  we  have: 

(31)  u'w  -  u'q  ^  z'(ITu)  <  0. 

Hence  w  is  not  non-negative;  that  is,  K  is  empty. 

This  completes  the  proof. 

Scheme  II.  For  this  scheme,  we  shall  assume  that  M  has  a 

posit lve  column.  We  work  directly  with  the  Fundamental  Problem. 

We  may  assume  that  the  first  column  of  M  is  positive.  Again 

if  q  >  0,  the  initial  basic  solution  is  a  complementary  solution 

We  assume  that  q  has  a  negative  component.  Then  increasing 

Zj  defines  a  unique  first  pivot  defined  by  the  pivot  pair 

(w  ,  z.);  for  some  r;  leading  to  the  basic  form: 
r  l 

(32)  w1  *  ql  4  llV;  q*  >  0; 

for  t  -  1 .  This  has  a  basic  feasible  solution  satisfying: 
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(33)  w’z  -  (i.e.;  WjZj“  0  for  i  /  1) 

Now  let  be  the  set  of  points  of  K  satisfying  (33). 

(Dantzig  has  called  such  points  'almost  complementary'  points). 

Entirely  analogous  to  Scheme  I,  Scheme  II  pivots 

so  as  to  retain  (33).  This  defines  a  proper  sequence  of 

piyots.  Since  points  obtained  from  (1)  for  z^  large; 

other  Zj  kept  at  zero;  are  points  of  C^,  increasing  wr 

in  (32)  (with  t  *  1)  defines  the  initial  unbounded  edge  Eq 

consisting  of  points  of  C..  Again,  associated  with  each 

whose  basic  solution  is  not  complementary 
basic  form  (32)/for  t  >  is  a  non-basic  complementary 

pair,  say  (w_,  z  );  one  of  which  became  non-basic  on  the 
s  s 

tth  pivot,  and  the  other  of  which  is  to  be  increased  to  define 

a  (t+l)st  pivot  if  possible.  If  the  pivot  is  not  possible, 

another  unbounded  edge  E  of  C ^  is  identified,  and  the 

sequence  of  pivots  terminates. 

Hence,  again,  ths  sequence  terminates  either  in  a 

complementary  solution,  or  in  the  unbounded  edge  E;  distinct 

from  E  . 
o 

The  case  M  >  0  furnishes  an  example  of  a  class 
for  which  both  Schemes  I  and  II  terminate  in  a  complementary 
solution  for  any  q,  since  M  is  in  C+,  and  since  there 
is  no  question  of  feasibility.  The  proof  that  Scheme  II 
works  is  extremely  simple: 

Theorem  2.  If  M  >  0,  Scheme  II  terminates  in  a  complementary 
solution  for  any  q. 

Pfoof :  We  point  out  that  the  set  C1  contains  only  the 
unbounded  edge  Eq  and  no  other.  Hence  that  the  pivots  cannot 
terminate  in  an  unbounded  edge.  Indeed,  if 
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(34)  w  =  w  +  Q  v;  z  =  z  +  S  u;  0  >  Q 

are  points  satisfying  (33),  and  are  feasible,  so  that: 

~  =  rfu;  u,  v  >  0;  and  u  /  0:  then  v  >  0,  since  M>  0. 

For  all  i  /  1,  by  (33):  0  =  w^^  (z ^  +  0  TL);  and  since 

w,  *=  w.  +  0  v .  >  0  for  0  0,  we  have:  z.  =  TT  =  0, 

lxl  11 

Hence,  on  the  assumed  unbounded  edge  (34)  only  z,  is 
different  from  zero;  hence  the  edge  is  E^.  Hence,  Scheme 
II  terminates  in  a  complementary  solution. 


A  slightly  different  situation  arises  in  the  case  of 
a  bi-matrix  game  example,  defined  as  an  M  of  the  form: 


=  1°'  A)  ‘ 

l  B,  0/  * 


where  A  >  0  ;  B>  0 . 


In  this  case,  M  is  co-positive,  but  is  not  in  C+f  since 
for  z  -  |q]J  ;  where  z1  /  0;  it  is  true  that  z'Mz  =  0,  but 

-((bSWJ'  °- 

Indeed,  for  q  o.f  the  form:  q  =  el  J  ;  where  the  e. 

\  e2l 

are  positive  columns,  (1)  may  be  written  as  two  formats: 

Az2  -  ex  =  w1?  Bz}  +  e2  -  Wg. 

Then  L  is  always  feasible;  but  requires  that  w2  ^  0  and 
z2  /  0,  so  that  w2z2  =  ®  cannot  be  satisfied.  Hence  Scheme  I 
cannot  terminate  in  a  complementary  solution  in  this  case. 

However,  the  case  where  q  =  -e  (e  >  0),  of  applica¬ 
tion  to  bimatrix  games,  is  an  eaample  of  a  pair  M,  q  for 
which  a  complementary  solution  always  exists.  A  first  con¬ 
structive  proof  of  this  was  given  in  Ref. [^4  J,  where  the 
type  of  pivotal  scheme  consider  in  this  section  was  first  de¬ 
viled.  The  scheme  is  almost  the  same  as  Scheme  II.  It  re¬ 
quires,  as  a  major  difference,  two  initial  pivots  to  achieve 
feasibility. 
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We  shall  first  describe  the  Scheme,  and  then  prove 
the  associated  theorem. 

Scheme  III.  In  the  case  in  point,  L  may  be  expressed  in 
the  disjoint  forms: 

I:  u  =  -e  +  Ay ;  u  >  0 ; 

(36) 

II:  v*=-e+Bx;  v  >  0; 

The  complementary  solution  condition  is: 

(37)  u'x  =  0;  and  v'y  =  0; 


y  >  0 
x  >  0 


thus;  (u,,  x.)  are  complementary  pairs;  and  (v.,  y.)  are. 
Although  the  two  positive  e's  have  different  order,  no 
confusion  need  arise.  Scheme  III  consists  of  the  following 
sequence: 

first  pivot :  In  I,  increase  yj  to  obtain  I-f easibil ity ; 
that  is,  pivot  on  the  pivot  pair  (ur,  y ^ ) ;  r  automatically 
defined. 

second  pivot :  having  determined  r  on  the  first  pivot; 
in  II  increase  xr  (complement  of  u^)  to  obtain  initiol 
II-feasibility .  Let  (v  x  )  be  the  pivot  pair. 

3  i 

Then,  if  s  =  1  the  resulting  basic  points  satisfy 
(37),  and  the  pivoting  terminates.  If  not,  they  continue. 

Let  Cj  denote  the  set  of  points  of  K  satisfying: 


(38) 


u'x  +  v'y  =  v.y 


1*1* 


1  ■  (!)  <' 


satisfies  (38) 


In  any  case,  the  point  z  =  l^|  where  y  and  x 
are  the  basic  points  of  I  and  II  respectively  obtained  by 
the  first  two  pivots, 

The  scheme  consists  of  alternating  pivots  on  I  and 
II;  always  increasing  the  complement  of  the  variable  which, 
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on  the  immediately-previous  pivot  became  non-basic.  By 
this  scheme  pivoting  is  automatic,  and  all  basic  points 
z  encountered  are  in  C^. 


Theorem  III  Scheme  III  terminates  in  a  complementary  solution. 
Proof ;  As  in  Theorem  2,  we  may  show  that  an  initial  unbounded 
edge  contained  in  Cj  is  the  only  unbounded  edge  contained 

in  Cj . 

Let  Kj  denote  the  set  of  Infeasible  points,  and 
similarly  Kjj  _for  II.  Associated  with  each  fcasic  point 
w  *  j— j  ;  z  =  |— |  of  K  are  the  basic  points  Cu,  "y)  of 
Kj  and  (v,  “x)  of  Ku»  and  during  a  pivot  only  one  of  these 
latter  points  is  changed.  In  particular,  points  on  an  unbounded 
edge  of  K  must  be  either  of  the  form:  (w,  z)  -  )  |xjj; 

where  Cv,  "x)  is  a  basic  point  of  Kjj,  and  (u,  y)  lies 
on  an  unbounded  edge  of  K^;  or  else  of  the  form 
(w,  z)  =  ,  j*jj;  where  7)  is  a  basic  point  of  Kj ; 

and  (v,  x)  lies  on  an  unbounded  edge  of  Kjj.  (This  is  true,, 
since  points  on  an  unbounded  edge  of  K  have  n  or  n-1 
zero  components.) 

The  unbounded  edge  E  is  identified  as  the  set  of 

o 


points  z  *  j  j  ;  obtained  from  the  first  basic  point  by 
allowing  xr  (complement  of  ur)  to  go  to  infinity. 

Now,  let  E  be  any  unbounded  edge  assumed  to  be  in  Cj. 
We  show  (Case  i)  that  E  cannot  involve  an  unbounded  edge  of 
Kj  and  (Case  ii)  if  E  involves  an  unbounded  edge  of  Kjj, 
then  E  is  Eq. 

For  Case  i,  E  is  a  set  of  points  of  the  form: 

(39)  y  =  y  +  0  y;  u  *u  4  0  u;  x  »  x;  v  7;  and  y  /  0. 

For  (38)  to  hold  for  all  0  >  0  requires  that: 
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.  A  A  A  ^  A 

(40)  u  -  Ay;  u  _  0;  y  _  0;  and  u'x  *  0. 

But  y  /  0  and  A  >  0  imply  that  u  >0.  Then  u'x  =  0 
implies  that  x  -  0;  which  is  impossible,  since  x  is 
Unfeasible. 


In  Case 

ii,  E  is  the 

set  of 

points  of  the  form: 

(41) 

y  -  y; 

u  -  u;  x  *=  x 

+  0  x; 

—  A  A 

v  -  v  +  Q  v;  x  r  0. 

And 

that  (38) 

hold  for  all  0 

>  0 

requires : 

(42) 

<> 

i 

DO 

X  > 

<> 

1 

o 

x> 

1 

o 

and 

A  „  A  ^  _ 

v  'y  -  Vlyi;  u'x  -  0. 

But 

x  /  0  and 

B>0  imply  that  v 

=  Bx  >  0.  Then  v^y^  =  i 

for 

i  /  l  implies  that  "y  - 

-rr*5 

fk  =  0;  that  is 

t 

jr^  *  0  for  i  /  1.  That 

is, 

y  has  only  one  positive 

component;  namely  Also,  since  (u,  7)  is  a  basic  point 

of  II,  and  y  has  only  the  positive  component  ,  all 
components  of  u  except  one  must  be  positive.  But,  by  the 
first  pivot,  this  component  must  be  ur:  ur  *  0.  Then  the 

_  A  _  A 

condition  u'x  *  0  requires  that  xi  *  x^  »  0;  for  i  ?  r; 
that  is,  only  xf  is  positive  in  (41).  This  identifies 
the  unbounded  edge  E  as  Eq,  and  completes  the  proof. 

We  shall  terminate  this  section  with  an  observation 
and  some  remarks. 

Referring  to  the  Fundamental  Problem  (1),  (2),  (3), 
consider  the  set  of  all  points  of  K  satisfying: 

(43)  w’z  -  w1z1  (that  is,  w^z^  -  0  for  j  /  i). 

CA  may  be  empty  of  course.  Assuming  that  L  is  non-degenerate, 
if  G  z)  1*  in  (43)  requires  that  the  point  have  at 
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least  n-1  zeros;  hence  by  non-degenera' y  either  n  or 
n-1  zeros.  In  the  former  case  the  point  is  casic  feasible; 
in  the  lattfer  case  it  lies  on  an  edge  of  K.  In  the  former 
case,  the  extreme  point  is  eith erja.  complementary  solution  of 
is  not.  If  it  is  a  complementary  solution  then,  in  the 
corresponding  basic  form  just  one  of  the  pair  (wit  Zj)  is 
non-basic,  and  only  that  non-basic  variable  may  be  increased 
to  remain  in  C^.  If  the  extreme  point  is  not  a  complementary 
solution,  then  both  members  of  the  complementary  pair  (w^,  z^) 
are  basic  in  the  corresponding  basic  form;  and  therefore,  to 
satisfy  (43)  the  other  n-2  basic  variables  must  have  their 
complements  non-basic.  Therefore,  the  other  two  non-basic 
variables  must  be  a  nou-bat;ie  pair  (w  ,  z  )  say;  and  making 
just  one  of  these  positive  is  the  only  way  to  move  from  the 
extreme-point;  that  is,  for  a  non-complementary  basic  point 
of  Ci  exactly  two  edges  contained  in  meet  in  the  point; 

and  moving  along  one  of  these  edges  leads  either  to  another 
extreme  point  ir.  or  along  an  unbounded  edge  of  points  oi 

Ci* 

We  therefore  conclude  that  t\  is  the  union  of  a 
finite  number  of  disjoint  adjacent-extreme-point  paths  (which 
are  ’proper’;  that  is,  10  extreme  point  on  the  path  has  more 
than  two  edges  joining  it;  and  has  one  edge  if  and  only  if 
it  is  a  complementary  point).  And  C  (the  set  of  complementary 
points)  is  precisely  the  set  of  end-points  of  the  paths. 

It  is  also  cleat  that  if  i  ^  i’,  a  point  ii  both 
and  Ci ,  must  be  a  complementary  solution.  Hence  C 
is  the  intersect  oi  the  sets  ;  1  =■  1,  2,  ...,  n. 
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We  have  used  these  'almost  complementary'  sets  in 
the  preceding  schemes. 

An  enumeration  of  the  possible  kinds  of  paths  in  a 
set  CA  brings  out  the  fact  that  the  path  may  contain  (i) 
no  complementary  solution  (in  the  case  where  the  path  has 
no  end-points;  that  is,  either  a  closed  path  or  one  which 
contains  two  unbounded  edges) ;  (ii)  two  complementary  solu¬ 
tions  (when  the  path  has  two  end-points;  i.e.,  is  not  closed 
and  has  no  unbounded  edges);  and  (iii)  one  complementary 
solution  (one  end-point  and  one  unbounded  edge).  In  the 
previous  schemes  favorable  paths  were  of  the  form  (iii). 

Regarding  such  schemes,  postulating  for  example 
that,  for  a  given  M  and  q,  for  some  i  has  only 

one  unbounded  edge  of  K  ensures  xhe  existence  of  a 
complementary  solution  (for  example,  the  case  M  >  0, 
and  Scheme  II). 

Also,  for  the  case  of  an  M  and  q  for  which  there 
is  a  unique  complementary  solution,  since  precisely  n 
edges  of  K  meet  in  this  point,  exactly  one  of  the  n 
edges  is  in  for  each  i,  and  the  paths  of  con¬ 

taining  that  edge  must  have  (at  the  other  end  of  the  path) 
an  unbounded  edge.  Therefore  one  may  conclude  that  such 
a  K  must  contain  at  least  n  unbounded  edges.  An  example 
is  given  by: 

Theorem  4:  If  M  is  posit ive-semi-def inite,  and  L  is 
non-degenerate  and  K  is  non-empty,  then  there  is  exactly 
one  complementary  solution. 

Proof :  If  (w,  "z)  and  (w*,  z+)  are  complementary  solutions, 

so  that  w 'z  «  w*’z*  -  0;  then  w*-w  -  U(z*-z)  and: 
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-(w*z  ♦  w'z*)  =  (Z*-z)  •  (w*-w)  =  (¥*-zVM(z*-'z)  ^  0; 

which  implies  that  w*,-z  *  w'^*  =  0;  which  implies  that 

*  __  _  _  _  $ 

Cw*+wT’ Cz*+z)  =  0;  hence  (w*+w,  z*+s!)  have  at  least  n 

zeros;  hence  at  most  n  zeros,  since  (e^  (w,  z)  is  basic. 
We  must  conclude  that  (w,  z)  and  Cw*,  1*)  have  the  same 
positive  components;  that  is,  the  same  basic  set.  Hence, 
since  a  basic  set  has  just  one  basic  point  associated  with 
it,  (w,  z)  =  (w*,  z*) . 
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C.  CONCLUDING  REMARKS. 

The  use  of  'Gauss- Jordan  pivots'  has  a  long  history; 
certainly  in  the  solution  of  systems  of  linear  equations, 
and  more  recently  in  Mathematical  Programming,  starting  fit h 
the  simplex  method  of  G.  B.  Dantzig.  The  format  (1),  (2)  is 
essentially  the  format  for  the  simplex  method.  It  has  been 
exploited  extensively  by  A.  V.  Tucker  (see  Ref.fio]  ),  and 
others  since. 

The  Fundamental  Problem  probably  assumed  importance 
as  a  form  for  solving  the  convex  quadratic  programming  prob¬ 
lem.  Essentially  all  methods  proposed  to  date  for  this  prob¬ 
lem  may  be  approached  from  this  form.  The  extension  of  the 
use  of  the  form  of  the  problem  and  the  applicability  of  simi¬ 
lar  schemes  to  bimatrix  games  (a  'non-convex'  example)  lent 
more  importance  to  the  Fundamental  Problem.  Examples  from 
sources  other  than  Programming  and  Game  Theory  are  noted  by 
Cottle  and  Dantzig  (Ref .[2]  ). 

By  way  of  extending  the  class  of  problems  which 
have  a  complementary  solution,  besides  the  classes  mentioned 
In  these  notes  the  classes  of  'adequate*  matrices  considered 
by  Singleton  (Ref.  f 8]  ),  and  matrices  with  'positive  prin¬ 
cipal  minors'  (p.p.m.)  considered  by  Cottle  and  Dantzig,  and 
by  A.  W.  Tucker  (see  Ref.[lo])  are  worth  mentioning. 

It  may  be  noted  that  Theorems  1-3  in  these  notes 
rely  on  the  particular  schemes  l-III;  which,  although  reason¬ 
ably  'natural'  do  not  preclude  the  discovery  and  development 
of  equally  valuable  pivotal  schemes,  applicable  to  these  or 
other  classes  of  problems.  For  example,  extending  the  class 
for  which  Scheme  I  is  appropriate,  1  -sons  (Ref. fa]  )  has 
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observed  that  matrices  which  are  'principal  pivotal  transforms' 
(see  Ref. [9]  )  of  co-positive-plus  matrices  could  also  be 
handled  by  Scheme  I.  As  another  example,  the  algorithm 
of  Cottle  and  Dantzig  (see  Ref.flj),  predating  the  devel¬ 
opment  of  the  Scheme  I,  exploiting  principal  pivot  transforms, 
may  be  used  on  the  class  of  p.p.m.  matrices.  Persons  (Ref. 

[6]  )  has  developed  a  principal  pivoting  scheme  for  the 

convex  quadratic  programming  problem,  and  has  also  pointed 
out  that,  for  this  problem,  (more  generally  for  positive- 
semi-definite  M)  Scheme  I  may  be  described  inAerms  of 
principal  pivots.  These  examples  are  certainly  not  ex¬ 
haust  ive. 

Scarf  (see  References),  in  a  series  of  Cowles  Commis¬ 
sion  papers, has  extended  the  use  of  the  'almost  complementary’ 
paths  with  potentially  valuable  results. 
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APPENDICES 

Appendix  1.  The  Convex  Quadratic  Programming  Problem. 

A  usual  statement  of  this  problem  is: 

(44)  Minimize  c*x  +  $x’Qx;  where  Ax  >  b;  x  >  0. 

A  general  form  of  the  linear  programming  problem  is  obtained 
by  taking  Q  -  0.  The  functional  is  convex;  which  is  the 
case  if  and  only  if  Q  is  a  positive-semi-definite  matrix 
(which  may  be  considered  symmetric).  The  general  'Kuhn- 
Tucker'  conditions  (see  Ref. [3]  )  in  order  that  an  x 
solve  (44)  become  necessary  and  sufficient  conditions. 

These  conditions,  when  written  concisely^  become:  find 
x,  y,  u,  and  v  satisfying: 


which  is  an  example  of  the  Fundamental  Problem.  In  this 
example:  /  0,  a\  ;  a  posit  ive-semi. -definite  matrix; 


hence  co-posit ive-pl us.  Therefore,  Theorem  1  gives  a 
constructive  proof  of  the  existence  of  a  solution  to  (45), 

(and  of  the  duality  theorem  of  linear  programming). 

Appendix  2.  Equilibrium  points  of  Bimatrix  Games. 

Mixed  strategies  are  columns  x  and  y  satisfying: 
e’x  -  1;  x  >  0  and  e’y  «  1;  y  >  0;  where  *e'  denotes 
a  column  of  appropriate  order  with  components  all  1.  A 
Mash  equilibrium  point,  for  given  r  by  s  matrices  A 
and  B  is  defined  (see,  for  example.  Ref  [4j  )  as  a  pair 
(x,  J)  of  mixed  strategies  such  that  for  all  mixed  strategies 
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(X,  y): 

x’Ay 

^  x’Ay 

(46) 

x’Efy 

<  x’By 

x'Ay  is  here  interpreted  as  the  ’long-range  loss  per  play 
to  Player  I'  if  he  pla^s  each  play  according  to  probabilities 
x  and  Player  II  plays  according  to  probabilities  y. 

Let  e^^  be  a  column  with  jth  component  1;  and 
other  components  zero.  The  e^  are  mixed  strategies  called 
pure  strategies.  It  is  readily  seen  that  (46)  holds  if 
and  only  if  it  holds  for  all  pure  strategies:  (x,  y)  =  (e,,  e  ). 

**  si 

In  vector  form  the  equivalent  condition  becomes: 

(x '  Ay )  e  ^  Ay 

(47)  ^  ; 

(x’By)e  <  B’x 

where  e  has  all  l’s.  If  E  -  ee’  is  a  matrix  of  all  l's, 

1  «(x’e)(e'y)  -  x'Ey.  Therefore,  adding  any  multiple  of 
E  to  A  and  B  in  (46)  does  not  change  the  set  of 
equilibrium  points.  We  may  therefore  assume  that  A  >  0;  B>  0. 
Hence  x’Ay  and  x’By  are  posit ive  for  mixed  strategies  (x,y). 
Consider  constraints  of  the  form: 


0le 

< 

Ay 

x’(Ay  -  0, e)  -  0 

(48) 

and 

V 

< 

B’x 

7’(B’x  -  02e)  -  0. 

Since  for  mixed  strategies  fx,  7),  the  ’complem* n .ary ’ 
constraints  on  the  right  are  simply:  0j  -*  "x’Ay;  02  "  * 
the  above  system  of  constraints  is  equivalent  to  (47). 
Next,  defining  variables  x  -  x/02;  y  *  <nd  intro¬ 

ducing  slacks,  (48)  becomes: 
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vj  \-o 


~e\  , 


0,  A  \  / 


B’  0/1  y)  v/ 


'■(%(X)>0;  (*V(-Lo, 

I  \  yj  \y  I  l  v  / 


a  problem  in  the  ’fundamental  form*  where,  in  this  case: 

jo,  A 

M  -  ^s,  qJ  .  Conversely,  since  for  x  and  y  to  yield 

feasible  u  and  v  requires  that  x  a  0;  y  ?  0,  if  (x,y) 
is  a  solution  to  (49),  "x  ~  x/x'e;  "y  •>=  y/c’y  yields 
a  solution  to  (48),  so  that  (48)  and  (49)  are  equiva¬ 
lent  . 


Appendix  3.  In  Section  A  we  have  remarked  that  a  ’pivotal 
scheme'  is  completely  determined  when  the  criteria  for  select 
the  pivot  pair  and  for  terminating  are  given.  The  following 
is  an  example  of  pedagogical  value  in  developing  the  basic 
elements  of  linear  algebra,  not  only  in  isolating  all  compu¬ 
tations  as  'pivot  schemes',  but  basing  the  usual  theorems 
(relating  to  independence,  rank,  etc.)  thereon.  (In  this 
regard,  see  also  the  remarks  of  A.  W.  Tucker,  Ref.JlOj). 

There  is  nothing  new  in  the  computations:  nor  indeed  in  the 
idea,  but  perhaps  in  the  presentation. 

Consider  finding  all  solutions  to  a  linear  system: 


(50)  0  =  q  +  Az. 

Ultimately,  this  means  'solve  (50)  for  some  of  the  variables 
in  terms  cx  the  remaining  ones".  Introduce  'pseudo'  variables 
w  and  consider  pivots  on: 


(51)  w  =  q  +  Az. 

Pivot  according  to  the  following  scheme:  having  obtained 
a  basic  form,  examine  the  current  non- basic  set.  If  it  contains 
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an  original  z.,  say  z.  =  z^,  examine  the  column  coefficient 
J  J  s 

A*.  If  At  „  /  0  for  some  r  such  that  w*"  is  a  pseudo 
s  r,  s  r 

variable,  perform  the  pivot  determined  by  the  pivot  pair 

f  f  t"  f 

(w  ,  z  ).  When  no  such  z  or  no  such  A  can  be  found 

r’  s  r  r,s 

terminate.  The  iterations  terminate  in  R  pivots,  where 
P  *  Rank  A.  The  final  basic  form  gives  all  solutions  to  (50) 
by  setting  all  pseudo  variables  equal  to  0. 
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ON  THE  OPTIMALITY  OF  (a,  S)  INVENTORY  POLICIES: 
NEW  CONDITIONS  AND  A  NEW  PROOF* 


ARTHUR  F.  VEINOTT,  JR.f 

Abstract.  Scarf  (61  has  shown  that  the  (s,  S)  policy  is  optimal  for  a  class  of  discrete 
review  dynamic  nonstationary  inventory  models.  In  this  paper  a  new  proof  of  this 
result  is  found  under  new  conditions  which  do  not  imply  and  are  not  implied  by 
.‘-''■arf's  hypotheses.  We  replace  Scarf’s  hypothesis  that  the  one  period  expected  coeta 
ai  convex  by  the  weaker  assumption  that  the  negatives  of  these  expected  costs  are 
unii.,odal.  On  the  other  hand  we  impose  the  additional  assumption  not  made  by  Scarf 
that  the  absolute  minima  of  the  one  period  expected  costa  are  (nearly)  rising  over 
time.  For  the  infinite  period  stationary  model,  this  last  hypothesis  is  automatically 
satisfied.  Thus  in  this  case  our  hypotheses  are  weaker  than  Scarf’s.  The  bounds  on 
the  optimal  parameter  values  given  by  Veinott  and  Wagner  [12]  are  established  for 
the  present  case.  The  bounds  in  a  period  are  easily  computed,  and  depend  only  upon 
the  expected  costs  for  that  period.  Moreover,  simple  conditions  are  given  which 
ensure  that  the  optimal  parameter  values  in  a  given  period  equal  their  lower  bounds. 
When  there  is  no  fixed  charge  for  ordering,  this  reduces  to  earlier  results  of  Karlin 
[5]  and  Veinott  [9],  [10],  (11)  for  the  nonstationary  case.  The  above  result  is  exploited 
to  extend  the  planning  horizon  theorem  of  Veinott  [9]  to  the  case  where  there  is  a 
fixed  charge  for  ordering. 

1.  Model  formulation.  We  consider  a  single  product  dynamic  inventory 
model  in  which  the  demands  A  ,  A ,  •  •  •  ,  for  a  single  product  in  periods 
1,2,  •  ,  are  independent  random  variables  with  distributions^  ,*,,•••. 

Assume  ( tj,|  are  given  constants  such  that1  A  £  Vi  for  all  ».  At  the  beginning 
of  each  period  the  system  is  reviewed.  An  order  may  be  placed  for  any 
nonnegative  quantity  of  stock.  An  order  placed  at  the  beginning  of  period  i 
is  delivered  at  the  beginning  of  period  i  +  X,  where  X  is  a  known  non¬ 
negative  integer. 

Let  x ,  denote  the  stock  on  hand  and  on  order  prior  to  placing  any  order 
in  period  i.  Let  y,  denote  the  stock  on  hand  and  on  x>rder  after  ordering  in 
period  ».  It  is  possible  for  x,  and  y,  to  be  negative  indicating  the  existence 
of  a  backlog.  We  assume  that  the  amount  of  stock  on  hand  and  on  order 
at  the  end  of  period  t  is  a  specified  Borel  function  v,(y, ,  A)  of  y,  and  A  . 

*  Received  by  the  editors  July  26,  1965,  and  in  revised  form  December  27,  1966. 

t  Program  in  Operations  Research,  Stanford  University,  Stanford,  California. 
This  research  was  supported  by  the  Office  of  Naval  Research  under  Contract  Nonr- 
226(77)  and  by  a  grant  from  the  Western  Management  Science  Institute. 

1  Actually  the  main  results  of  the  paper  given  in  |2  also  hold  in  the  more  general 
case  where  D ,  is  a  random  vector.  All  that  is  required  is  to  let  IX  be  the  Borel  set  of 
possible  values  of  D ,  and  replace  the  interval  [%  ,  • )  of  possible  values  of  A  every¬ 
where  by  X>,  .  This  more  general  formulation  allows  for  consideration  of  several 
classes  of  demands,  random  deterioration  rates,  random  departures  of  backlogged 
demand,  and  random  prices,  for  example,  by  suitable  interpretation  of  the  compo¬ 
nents  of  D ,  . 
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Thus  Xi+i  —  Viiy, ,  D,).  If  X  >  0  we  assume  that,  all  unsatisfied  demand  is 
backlogged  so  v,(y> ,  D,)  =  ?/,  -  D, . 

When  X  =  0  our  formulation  provides  for  the  possibilities  of  deterioration 
of  stock  in  storage  (perishable  goods)  and  partial  backlogging  of  unsatisfied 
demand  [11,  p.  766J.  For  example  suppose  that  whenever  y,  <  Dx ,  then  a 
fraction  b  (0  ^  b  S»  1)  of  the  unsatisfied  demand  is  backlogged  and  the 
remainder  leaves  immediately.  If  instead  y,  >  D, ,  then  a  fraction  1  —  a 
(0  o  ^  1 )  of  the  inventory  on  hand  sjxjils  and  is  not  available  for  future 
use.  These  assumptions  imply  that  •)  takes  the  form 


VtiViyDi) 


\a-(y,  -  Z>.) 
\b-(y<  -  D,) 


if  y .  ^  Di , 
if  y,  ^  D, . 


Note  that  if  o  =  0  we  have  the  case  of  perishable  goods  while  if  a  —  1  we 
have  the  case  of  nonperishable  goods.  If  b  =  0  we  have  the  lost  sales  case 
while  if  b  =  1  we  have  the  backlog  case.  In  the  literature  these  last  two 
cases  are  usually  discussed  only  where  a  =  1. 

At  the  beginning  of  period  i,  the  inventory  manager  is  assumed  to  have 
observed  the  vector 


JJ i  ~~  ,  •  •  •  >  x« ,  yi ,  •  •  •  ,  j/i— i ,  D\  y  •  •  •  ,  Dt— i), 

representing  the  history  of  the  process  up  to  the  beginning  of  period  i.  He 
bases  his  ordering  decision  in  period  i  upon  H, . 

An  ordering  policy  for  period  i  is  a  real  valued  Borel  function  F,(  • )  to 
be  used  as  follows.  At  the  beginning  of  period  i,  after  having  observed  the 
past  history  Ht ,  the  manager  orders  —  x,  which  is  assumed  to  be 

nonnegative  of  course.  Also  let  7 ,  =  (?,,•••,  P„)  denote  a  sequence  of 
ordering  policies  for  periods  *,•••,  n. 

Three  types  of  costs  are  considered:  ordering,  holding,  and  shortage. 
Assume  that  the  cost  of  ordering  z  units  in  period  i  is  Kt6(z)  +  c,z,  where 
Ki  £  0,  J(0)  *  0,  and  6(z)  *  1  for  z  >  0.  The  cost  is  incurred  at  the  time 
of  delivery  of  the  order.  Let  g,(y,  D,^K)  denote  the  holding  and  shortage 
cost  in  period  i  +  X  when  y  is  the  amount  of  stock  actually  on  hand  after 
receipt  of  orders  to  be  delivered  before  the  end  of  period  i  +  X.  We  assume 
that  0i(-,  • )  is  a  real  valued  Borel  function. 

Let  «,( £0)  be  the  discount  factor  for  period  i  +  X.  That  is,  a,  is  the 
value  at  the  beginning  of  period  t  -f  X  of  one  cost  unit  at  the  beginning 
of  period  t  +  X  +  1.  Let  &  -  1  and  0,  -  «/  f°r  *  >  L 

For  the  case  X  ■*  0  let 

w,(y,  t)  «=  c,y  4-  g,(y,  t)  -  t). 
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For  the  case  X  >  0  let 

W,(y,  t)  -  Ciy  4-  f  g,(y  -  z,  t)  d$,x(z)  -  aiC^iy  -  E{D,)), 

«L  oo 

where  4>,x(  ■ )  is  the  distribution  of  Z),  +  ■  •  •  4-  Z),+a_i . 

Now  for  X  S  0  let 

Gtiy)  -  £V(y,t) 

We  assume  that  all  integrals  given  above  exist  and  are  finite. 

We  suppose  that  each  unit  of  stock  left  over  after  X  4-  »  periods  can  be 
discarded  with  a  return  of  c„+i .  Similarly,  each  unit  of  backlogged  demand 
remaining  after  X  +  n  periods  is  satisfied  at  a  cost  c,+i .  In  the  literature  it 
lias  often  been  assumed  that  <v,i  =  0. 

Thus,  the  expected  discounted  cost  incurred  in  periods  X  +  1,  •  •  ■  ,  X  n 
when  following  the  policy  Pi  in  periods  1,  •  •  •  ,  n  is 

£  jZ  -  Xi)  +  Ciiyi  -  Xi)  +  ffi  (yx  -  E  /)y, 

“  0»+l  C»+l  ^n+l  ~  ^  ^  • 

By  substituting  x,  =>  ,  D,^)  into  the  above  formula  we  get  as  in 

[10],  [12], 

•  r  *+x  "i 

Z  E[Ki  4(y,  —  Xi)  4-  (?,(#,)]  —  Ci  ii  —  0»+i  c«+i  Z  E{Di)  I . 

i-l  L  *•■+»  J 

Since  the  second  bracketed  term  is  not  affected  by  the  choice  of  Pi  ,  it  is 
convenient  to  omit  it  from  the  analysis.  Thus  we  may  define  the  conditional 
expected  discounted  cost  incurred  in  periods  X  +  *,•••,  X  4-  «  when 
following  P,  in  periods  i,  •  •  •  ,  n  given  the  observed  history  H,  as 


(1)  M?i  j  H.)  -  £  diE.MCMy,  -  x^  +  QAVi)). 

We  seek  a  policy  Pt*  »  ( P|*,  •  •  • ,  P„*),  called  optimal,  which  satisfies 


(2) 


S/.(P,| Hi), 


»  -  t, 


for  all  Hi  and  P, ,  where  of  oourse  f*  -  ( ¥*,  •  •  •  ,  P,*).  It  is  easy  to 
show  by  induction  on  i  (starting  with  i  -  n)  that  if  there  is  an  optimal 
policy,  then  /,(P,*  |  Hi)  depends  upon  Hx  only  through  x, ,  so  we  may 
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write 

(3)  «/,(x.),  i-1,  -,n, 

where  the  /<  satisfy  (/«+ i(x)  *  0) 

(4)  /i(z)  -  inf,t,  |A,4(y  -  x)  4-  <7,(y)  4-  «>Ef,-n(v,(y,  £>,))} 

for  *  -  1,  •  •  •  ,  n  and  all  x  with  the  infimum  being  attained  for  each  x. 
Conversely  if  there  is  a  sequence  of  functione  |/,|  which  satisfy  (4),  with 
the  infimum  being  attained  for  each  x,  then  there  exists  an  optimal  policy 
T*.  Moreover,  is  any  value  of  y  which  minimises  the  expression 

in  braces  on  the  right  side  of  (4)  subject  to  y  £  x  where  x  *  x, .  Since  the 
minimising  value  of  y  is  a  function  only  of  x, ,  it  follows  that 
m  depends  upon  //,  only  through  x, .  For  Rotational  convenience, 

we  define 


(5)  Ji(y)  -  G,(y)  4-  atEfi+i(vt(y,  D,)). 

In  what  follows  we  shall  have  occasion  to  impose  one  or  more  of  the 
following  assumptions  for  each  i  (Km+ 1  ■  0): 

(>)  Gi(y)  and  v,(y,  t)  are  continuous  in  y  for  each  /  £  ; 

(ii)  liny~  Gt(y )  >  inf*  G,(y)  4-  aJt.+i  ; 

(iii)  llm^—  (7,(y)  >  inf,  G,(y)  4-  Kt ; 

(iv)  -Gi(y)  is  unimodal  in  y; 

(v)  ®.(y,  0  is  nondecreasing  in  y  for  each  t  £  v.  ;  moreover,  v,(y,  t)  is 
bounded  above  in  t  on  (*, ,  * )  for  each  fixed  y; 

(vi)  Ki  2  aJCi+i . 

If  (i)-(iii)  hold,  there  are  a  number  $.  which  minimises  G,(y)  on 
( —  « ,  « )  and  numbers  j,(  $  §,)  and  S,(  ii  $,)  such  that 

0.(5.)  *  (?,(&)  4-  a*K,M 

and 

G.(t.)  -  <?.($,)  4-  K,. 


If  in  addition  (vi)  holds,  there  is  a  number  I, ,  f,  £  J,  £  ,  such  that 


0.(1.)  -  <?.(&)  4  (A.  -  a.A,„). 

S.  The  optimality  of  the  (a,  S)  policy.  In  this  section  we  shall  show  that 
that  if  (i)-(vi)  hold  and  if 


(vii)  ®,(§, ,  I)  £  for  l  £  *  and  »  *  1,  2,  •  ,  n  -  1, 

then  there  is  an  optima)  policy  which  is  an  (#,  S)  policy.  By  this  we  mean 
that  there  is  a  sequence  { (s, ,  «S,)|  of  pairs  of  numbers  such  that  (*<  £  S,), 


S,  if  x,  <  t, , 

x,  if  X,  £  *, , 


INVENTORY  POLICIES 


1071 


6,(y) 

N 

f  K,  -  aiK,*i 

\  s* 

1  T" 

I  i- 

|  Kj-a,Kl+i 

T"~ 

—  rVi  ! 

i  I  iii  l  „ 

li  *t  5i  ®i  y 

Fio.  1 

for  all  i  and  H, , 

Moreover  the  numbers  satisfy 

(6) 

5,  2  a.  2  «.  2  &  2  5,  £  S. 

and 

(7) 

(?.(«,)  £  G,(S, )  +  (K.  -  «JC.+l) 

for  all  i.  The  bounds  in  (6)  and  (7)  are  depicted  in  Fig.  1.  The  inequality 
(7)  has  the  interpretation  that  if  it  is  optimal  to  order  in  period  t  with 
the  initial  inventor>-  level  jr,  then  the  reduction  in  the  immediate  expected 
costs  due  to  G,(  • )  must  be  at  least  K,  —  aj(,+ , .  In  the  special  case  where 
g,  »  0  and  c,(y,  t)  -  y  —  t  for  all t,  then  (vii)  reduces  to  the  simpler  form: 
2  §,♦«,*  -  1,  •  ,n  -  1. 

The  first  proof  that  the  («,  S)  policy  is  optimal  under  reasonably  general 
conditions  is  due  to  Scarf  {6],  (See  also  (13).)  Scarf  assumes  that  G,(y)  is 
convex  in  y  with  G,(y)  -*  *  as  j  y  }  -*  * ,  that  e,(y,  1}  -  y  —  t,  and  that 
(vi)  holds.1  Three  assumptions  imply  (i)-(vi)  and  are  in  fact  quite  a  bit 
strongr*.  To  elaborate  on  this  point  we  remark  that  if  H\(y,  l)  is  convex 
in  y .  then  <7,(y)  is  convex  in  y  for  any  distribution  ♦,+* .  However,  if 
has  a  density  with  -  i)  having  a  monotone  likelihood  ratio 

with  respect  to  tf,  then  ~G,(y)  will  be  uni  modal  under  conditions  where 
IF,(y.  I)  is  not  convex  in  y.  Specifically,  -GAy)  ia  unimodal  if  W\(g,  /) 
"  -  I)  +  1F,*(0  for  some  functions  W'  and  W'  where  —  WVfa) 

»  unimodal  in  t,  (3).  See  {4}  for  a  discussion  of  the  utility  of  this  assumption. 

Although  Scarf  impose*  stronvsr  -sumptions  than  (i)~(vi),  he  dk)es  not 
require  that  (vii)  hold.  Thus  ms  results  are  not  implied  by  ourt  nor  ooo- 
veraely. 

'Scarf  also  assumed  that  the  coat  fucctkuw  and  demand  distribution*  do  not 
change  crer  Vn.e  although  that  ia  not  amentia!  to  hie  proof  (8,  p.  300). 
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Ai  an  example  to  illuminate  the  significance  of  (vii)  suppose  X  =  0 ; 
Ai  •  •  »  A.  are  identically  distributed;  and  c,  *  c  a,  =*  a,  g,(y,  t) 
-  9(y,/)»Vi(y,0  -  &  -  t,andi.  «  Ofort  =  1,2,  ••  •  ,n.Then(?,(|/)  -  G(?/) 
so  5*  *  5  for  i  **  1,2,  -,  n  —  1.  If  in  addition  c,+i  =  c,  then  GJy) 
<m  O(y)  also,  so  5*  »  $  and  (vii)  holds.  On  the  other  hand  if  r,+t  -  0 
(as  is  assumed,  for  example,  by  Scarf  [6]  end  others),  then  G„( y)  **  G(y) 
4  ac\y  —  E(D *)].  Thus  if  ac  >  0,  we  should  ordinarily  expect  §  >  £•  in 
which  case  (viii)  fails  to  hold  for  i  =  n  —  1.  Both  of  the  above  definitions 
of  c,+i  are  reasonable  formulations  of  a  ‘‘stationary”  version  of  our  model. 
It  is  of  some  interest  then  that  the  first  assumption  assures  that  (vii)  holds 
while  the  second  does  not.  Of  course  in  infinite  horizon  models  jlj,  }2j,  the 
difference  between  the  two  stationary  formulation-  vanishes.  Thus  our 
hypotheses  are  actually  weaker  than  Scarf’s  for  the  infinite  horizon  sta¬ 
tionary  models. 

Bounds  on  «.  and  5,  were  first  established  by  Iglehart  ir.  [ij,  (2|  under 
Scarfs  hypotheses,  the  assumption  that  the  cost  functions  and  demand 
distributions  do  not  change  over  time,  and  cr.  «  0.  Coder  the  abov- 
assumptions  except  c,+i  *■  c,  Veinott  and  Wagner  [12]  have  established 
bounds  of  the  form  (6).  In  their  analysis  (V,iy'  «  Giy),  so  the  bounds  in 
(6)  are  independent  of  t  even  though  this  is  not  true  of  *,  and  S, .  Our 
present  analysis  shows  that  the  bounds  remain  valid  under  the  weaker 
hypotheses  imposed  here. 

The  principal  tool  of  Scarfs  proof  is  the  fat*  that  if  J,(y)  is  A', -convex 
(see  [6]  for  a  definition),  then  so  is  /,( x\  This  mot  hex!  of  proof  fails  under 
our  hypotheses  because  J,(y)  need  not  be  K, -convex. 1  Our  proof  is  based 
instead  upon  the  following  two  lemmas  which  establish  properties  of  func¬ 
tions  satisfying  (4),  (5). 

Lemma  1. 

(8)  /.<*>  £  /.(V)  tfii,  x  £  /. 

Moreover,  if  (v)  holds,  then 

J,(y)  —  J.iy)  £  G,{y)  —  A* y)  -  a,K,ti, 

(9) 

Proof. 

1  rom  (4)  and  <  -V)  we  have  for  x  £  x  that 
/,(x)  £  A\  4  inf  J,(y)  £  A',  4  mfJ.iy)  £  A'.  4  f,ix), 

•  i<  *£»■ 

which  esUhltshre  (8). 

*  As  an  illustration.  if  A’,  *•  K  <  i.  t*.  »?•  -  (by)  ~  min  ;i, !  jr  ().*.  C  0. 

t)  —  g  —  I,  and  a,  -  «  £  1,  tb*n  (i*-fvii;  hold  However,  y.ijr;  -  Giy)  which 
is  not  Km- convex 


«  *  I,  ,  n, 

y  %  v, 

t  «  1,  •  ,  n. 
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For  y  2»  y ,  we  have  from  (v)  that  u,(y,  Dt)  £  v,(y Di).  Thus  from 
(4),  (5),  and  (8)  we  get 

Ji(y')  -  J,(y) 

-  G.(v')  ~  Gi(y)  +  aAU+My,  2>.))  -  fi+Mv,  A))} 
^  G.(y  )  —  (*.(y)  —  <*iKi+ 1 , 
which  completes  the  proof. 

The  proofs  of  (8)  and  (9)  are  purely  analytic.  An  alternative  pmof  of 
(8)  may  be  constructed  by  u.Mng  the  following  argument  which  may  be 
made  rigorous.  If  the  initial  inventory  on  hand  and  on  order  at  the  be¬ 
ginning  of  period  i  is  x  but  one  orders  in  each  period  )  ( j£i)  so  as  to  bring 
the  inventory  level  after  ordering  to  the  level  which  would  be  optimal 
if  the  initial  inventory  level  in  period  i  were  instead  x  ( six),  then  the 
associated  expected  discounted  cost  would  not  exceed  ft{x  )  +  Ki .  But 
since  the  policy  just  described  cannot  be  better  than  the  optimal  policy, 
(8)  must  hold.  A  similar  kind  of  argument  may  be  used  to  establish  (9). 

Lemma  2.  If  (v)  holds,  if  ja^j  is  a  .sequence  of  numbers  for  which  e,(a> ,  t) 
5  a  hi  for  l  £  if,  and  j  2*  i,  and  C  G,{y)  »>  nonincreating  m  y  on  ( —  ® ,  a,] 
for  j  &  i,  then 

(10)  J,(y)  -  J,(y)  £  Gi(y)  -  <7,(y)  £  0,  y  £  y'  £  a, , 
and 

(11)  /,<*')“/,(*)  S  0,  xSx 

forj  £  J 

Proof.  The  proof  is  by  induction  on  ).  Suppose  (10),  (11)  hold  for)  +  1 
(>*)•  By  (v),  v,(y,  D,)  £  v,(y,  D,)  £  v ,(o> ,  £,)  £  Oj+, .  Hence  using 
(11)  for  j  +  1  we  get 

JAv)  -  J*iy) 

m  G,(y)  -  G,(y)  4-  otE\J)AX{v,(y .  D,))  -  f,+i(v,(y,  Dj)) ] 

£  G, (y)  -  G,(y)  £  0, 

which  proves  (10)  for). 

It  follows  from  (10)  for  the  integer)  that 

/,( x)  -  min  (J/xi,  K,  +  inf  J,iy)\ 

*>* 

£  min  \JJt).  K,  +  inf  J,(y)|  -  f,(z), 

»>» 

which  proves  (11)  for  the  integer  ).  The  same  arguments  suffice  to  establish 
(10),  (11)  for)  —  n  which  starts  the  induction  and  completes  the  proof. 
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The  proofs  of  { 10)  and  (11)  are  purely  analytic.  An  alternative  proof  of 

(11)  may  be  devised  by  using  the  following  argument  which  can  be  made 
rigorous.  Suppose  the  initial  inventory  level  in  period  j  is  x  .  Suppose  also 
that  one  orders  so  as  to  bring  the  initial  inventory  level  after  ordering  in 
each  period  k  ( «£;)  as  close  as  possible  to  the  level  which  would  be  optimal 
if  the  initial  inventory  level  in  period  j  were  x  (^x).  This  policy  incurs 
expected  discounted  costs  which  are  no  greater  than  f,(z).  But  the  policy 
also  must  incur  expected  discounted  costs  at  least  as  large  as  f/yx).  Com¬ 
bining  these  remarks  proves  (11).  The  inequality  (TO)  can  be  justified  in 
a  similar  way. 

Theorem  1.  If  (i)-(vii)  hold,  there  ext&s  an  optimal  policy  which  is  an 
(«,  S )  policy.  Moreover ,  the  parameters  of  that  policy  satisfy  (6),  (7). 

Proof.  The  proof  is  constructive  and  proceeds  in  several  steps.  To  begin 
with  suppose  J,(y)  is  continuous  in  y. 

( a)  Ji(y)  is  nonincre&sing  on  ( —  » ;  £,]. 

To  see  this  recall  from  (iv)  that  Gj(y)  is  nonincreasing  in  y  on  (—<*>,  &,] 
for;  2;  i.  Thus  by  (vii)  and  Lemma  2,  (a)  holds. 

Since  J,(y)  is  continuous,  there  is  an  Si  which  minimizes  Ji(y)  on 
<§,].  Thus  Si  satisfies  (8).  Moreover, 

(b)  mi nvJi(y)  =  Ji(S,). 

To  see  this  observe  from  (a)  that  St  minimizes  J,(y)  on  (—»,  5,]. 
Also  by  Lemma  1,  (iv),  and  the  definition  of  5,  we  have  for  y  >  5,  that 

Ji(y)  -  MSi)  *  G,(y )  -  Gi(Si)  ~ 

Z  Gi(Si)  -  Gi(Si)  ~  «,#.> i  -  0. 

Thu3  (b)  holds. 

(c)  There  exists  a  number  s,  satisfying  (6),  (7)  and 

(12)  Ji(Si)  +  Ki  -  Ji(st)  -  0. 

In  order  to  prove  this  assertion  we  observe  from  Lemma  2,  (b),  and  the 
definitions  of  §,  and  s,  that 

Ji(Si)  +  Ki  —  £  J,(St)  +Ki~  Ji(si) 

(13) 

£  Gi(Si)  +  Ki  -  Gi(Si)  -  0. 

On  the  other  hand  bv  Lemma  1  and  the  definitions  of  §,  and  8,  we  have 

i  JfSi)  +  Ki  -  Ji(ii)  fe  Gi(Si)  -  (7,(8, )  +  Ki  -  a<Ki+l 

{'"J  Z  Gi(&i)  -  G\(S.)  +  Ki-  a.iKi+1  -  0. 

From  (13),  (14).  and  the  continuity  of  J,(y),  it  follows  that  there  is  an 
a,  satisfying  (6)  and  (12).  Moreover  (7)  holds  also  since  by  Lemma  1 
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and  (12)  we  have 

0  =  JiiSi)  +  K<  -  Jt( Sl)  £  Gi(Si)  -  (?,(«,)  +  Ki  -  aXm 
which  completes  the  proof  of  (<■). 

(d)  The  value  of  y  which  minimizes  the  right  side  of  (4)  is  determined  by 

Si  if  x  <  8 , , 

\x  if  x  ^  Si . 


y  = 


To  prove  (d),  observe  from  (a),  (b),  and  (c)  that  for  £<*,-, 


Ji(z)  ^  «/.■(*.)  =  J,(Si)  +  Ki  =  min„  Ji(y)  +  X,-, 

so  y  =  Si  minimizes  the  right  side  of  (4).  Now  for  s,  ^  i  ^  Si ,  the  same 
arguments  give 

Ji(z)  =  J,M  =  min uJi{y)  +  K, , 

so  y  =  x  minimizes  the  right  side  of  (4).  Finally  for  Si  <  x  <  y,  we  have 
from  Lemma  1,  (iv),  and  (vi)  that 


Ji(y)  +  K,  -  J,(x)  ^  Gi(y)  -  G,(x)  +  Ki  -  a,K,+l  £  0, 


so  y  -  x  minimizes  the  right  side  of  (4).  This  completes  the  proof  of  (d). 
It  remains  only  to  verify  our  assumption  that 
(e)  Ji(y)  is  continuous  in  y. 

We  prove  (e)  by  induction  on  i.  The  assertion  is  trivial  for  i  =  n  since 
Jn(y )  =  Gn(y).  Suppose  now  (e)  holds  for  the  integer  t  +  1.  Then  by  (d), 


(15) 


/*+»(*) 


J  Ki+i  +  J,+i(5i+i)  if  x  <  8i+ j , 
[Ji+i(x)  if  x  g  «i+1 . 


Since  J,-n(y)  is  continuous  and  (12)  holds  for  t+1,  fi+i(z)  is  evidently 
continuous.  Since  (?,(y)  is  continuous  b>  (i),  J,(y )  will  be  continuous  if 


Efi+i(v>(y,  />•))  =  q(y) 

is  continuous  in  y  on  any  arbitrary  interval,  fa,  b],  say.  Since  by  (i)  and  the 
continuity  of/,+i ,  the  composite  function  fm(v,(y,  i))  is  continuous  in  y, 
q(y)  will  be  continuous  on  [a,  b]  if  the  composite  function  is  uniformly 
bounded  for  y  f  fa,  6]  and  all  t  by  virtue  of  the  dominated  con¬ 

vergence  theorem.  We  now  show  that  there  is  a  number  B  such  that 

(16)  Ji+x(Si+l)  £  fi+i(vi(y,  t))  £  Ui(B)  +  Ki+i 


for  all  t  (^Vi)  and  y  6  [a,  6],  which  gives  the  desired  bounds.  The  left- 
hand  inequality  follows  from  (15)  and  (b).  Since  by  (v),  v,(y,  t) 
£  vx(b,  t)  B  for  some  B,  the  right-hand  inequality  follows  from  Lemma  1. 

The  proof  is  now  complete  since  we  have  constructed  a  solution  to  (4) 
with  the  infimum  being  attained  for  each  x. 


1076 


ARTHUR  F.  VEINO’FT,  JR. 

As  we  have  remarked  before,  if  Scarf’s  hypotheses  ( Gi(y )  convex, 
Gi(y)  —*  oc  as  j  y  |  — ►  «,  t>,(y,  t)  =  y  -  t,  and  (vi) )  are  substituted  for 
(i)-(vii),  then  there  exists  an  optimal  policy  which  is  an  (a,  5)  policy. 
However,  the  lower  bounds  in  (6)  for  a,  and  S,  are  no  longer  valid  when 
(vii)  fails  to  hold.  The  reason  for  this  is  clear  upon  reflection.  For  exam¬ 
ple,  suppose  jD„_i  ^  0  and  P(Z)n-i  <  §«-i  —  Sn)  >  0  so  (vii)  iocs  not 
hold.  In  this  event  it  is  apparent  from  (1)  that  one  would  not  want  to 
order  up  to  S,,_i  (or  more)  in  period  n  —  1  if  Gn(y)  increased  sufficiently 
rapidly  on  the  interval  [<S»  ,  S,_ij.  The  reason  for  this  is,  of  course,  that 
the  relatively  low  expected  costs  in  period  n  —  1  would  be  more  than  offset 

by  the  extremely  high  expected  costs  in  period  «.  For  a  concrete  illustra¬ 

tion  see  footnote  4  below. 

Let 

g.  /&  if  *  = 

~l  \min  (§i,§m  +  *)  if »  -  1,  2,  •  •  •  , n  -  1. 

Let  8i  ( ;£  min  (a, ,  S,) )  be  chosen  so  that 

Giia)  =  Gi(§i)  +  Ki . 

Theorem  2.  Under  Scarfs  hypotheses,  there  is  an  optimal  policy  { («, ,  S,)} 
which  satisfies  (7)  and 

(6;)  $,  ^  8,  £  Si  and  Si  :§  S,  £  Si,  t  «  1,  2,  •  •  •  ,  n. 

Proof.  We  only  sketch  the  proof,  leaving  the  details  to  the  reader.  The 
upper  bounds  on  s,  and  Si  and  the  inequality  (7)  are  established  by  ap¬ 
plying  Lemma  1  in  exactly  the  same  way  as  in  Theorem  1 .  The  lower  bounds 
on  Si  and  S,  may  be  established  by  applying  (10)  with  a,  —  S,  for  ally 
in  a  manner  similar  to  that  employed  in  proving  Theorem  1. 

We  remark  that  if  (vii)  holds  then  S,  =  §,  and  a,  =  s,  for  all  i  so  that 
(6*)  reduces  to  (6). 

3.  Planning  horizons  and  special  cases.  The  next  result  tells  us  that  if 
Si,  is  sufficiently  small  in  comparison  with  s*+1 ,  then  (a* ,  §k)  is  optimal 
for  period  k.  Observe  that  this  is  the  policy  that  is  optimal  for  period  k 
when  considered  by  itself  or  as  the  final  period  of  a  fc-period  model.  More¬ 
over,  if  S, ,  •  •  •  ,  Sk  are  sufficiently  small  in  comparison  with  8*+i ,  an 
optimal  policy  for  periods  i,  •  •  •  ,k  may  be  determined  without  evaluating 
/*4i(x)  for  any  x.  In  this  sense,  period  k  is  a  planning  horizon.  The  actual 
calculations  are  carried  out  using  (4)  recursively  where  /*+ i(x)  »  0  for 
all  x.  The  theorem  generalizes  some  results  in  [9]  to  the  case  where  there 
is  a  setup  cost  for  placing  orders. 

Theorem  3.  Suppose  (i)-(vii)  or  Scarfs  hypotheses  hold,  and  that 
{(*>  >  5,)}  is  an  optimal  policy. 
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(a)  //{ <iy)  is  a  collection  of  numbers  for  which 
(17  )4 

(18)  a*+ 1  ^  , 

(19)  fi(fli » 0  =»  o>+i  /or  t  and 

(20)  S,-  £ 

/or  j  —  A,  then  (s* ,  £*)  is  optimal  for  period  k. 

(b)  If  ( 18) — (20)  hold  for  j  =  i,  i  4-  1,  •  •  •  ,  k,  then  one  optimal  policy 

for  periods  i,  i  +  ,  k,  is  independent  of  /t+1(  • ). 

Proof.  We  begin  by  proving  part  (a).  From  Theorem  1,  Scarf’s  theorem, 
and  (18), 

(21)  /h-i(z)  =  Kt+i  -F  Jh-.(S*+1)  s  5,  x  ^  ot+x . 

It  follows  from  (v)  and  (19)  that  for  t  g  jj*  and  y  3a  at ,  t/*(y,  f) 
<»  t>*(a* ,  t)  £  o*+ 1 .  Combining  this  remark  and  (21)  we  get 

(22)  J*(y)  =  Gt(y)  -F  atEft+iivtiy,  Dk ))  —  Gt{y)  -F  a*Q,  y  ^  o* . 

Now  by  (20),  «/*(s/)  achieves  its  minimum  on  (—  »,  ®)  in  (— <®,  a*). 
Since  this  is  so  it  follows  from  (22)  and  (17)  that  &  minimizes  Jt(y) 
on  (-  oo,  oo ).  Moreover,  again  by  (22)  we  have 

Jt(8k)  -  <&(*)  +  «*<?  =  ft  +  <?*(&)  +  a*0  =  ft  +  /*(&)• 

Hence,  by  Theorem  1  and  Scarf’s  theorem,  (& ,  5*)  is  optimal  for  period 
fc,  which  establishes  part  (a). 

In  order  to  prove  part  (b)  we  observe  from  Theorem  1,  Scarf’s  theorem, 
and  (20)  that  the  optimal  policy  in  period/,  t  s£  /  £  fc,  can  be  determined 
provided  only  that  we  can  evaluate  J j{y)  for  y  g  aj .  Now  from  (v), 
(19),  and  (20)  it  follows  easily  by  induction  on  /  that  J/(y)  may  be  evalu¬ 
ated  for  y  Sa  o>  without  evaluating  fk+i(x)  for  x  >  oh-i  ,  i.e.,  for  y  £  aj, 
Jj(y)  depends  upon  /*+ 1  only  through  the  constant  Q.  (We  have  already 
shown  this  for /  =  k  which  starts  the  induction.) 

We  may  exhibit  the  dependence  of  J j(y)  upon  Q  by  writing  Jj°(y). 
It  U  easy  to  show  by  induction  on  /  that 

(23)  /A*)  -//(») +Qg«o  /«  M  +  l, 

4  The  hypothesis  (17)  is  easily  seen  to  be  satisfied  if  (18)-(20)  hold  for  j  «  k  and 
either  (1)  (i)-(vii)  hold  or  (2)  Scarf’s  assumptions  are  fulfilled  and  Kt+i  >  Oor St  <  a*. 
The  following  example  shows  that  (17)  cannot  be  dispensed  with  under  Scarf’s  hy¬ 
potheses.  Assume  n  -  *  +  1  -  2,  Oi(y)  -  1  y  -  2  j,  Qt(y)  »  2  |  y  —  1  |, Ki  *»  Kt  m  0, 
P{Di  m  0)  m  P(Dt  m  0)  “  1,  and  s>  *  v*  m  0.  Then  *i  ■  Si  ■  *i  •  Si  ■  1  and 
|i  -  “  2-  Now  let  at  •  at  ■  1.  Then  (18)-(20)  hold,  but  (ji  ,  Si)  »  not  optimal 

for  period  1. 
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(Again  we  have  already  done  this  for  ;  «  k  which  starts  the  induction. ) 
Thus  if  a  policy  is  optimal  for  period  i  £  j  £  k,  for  some  Q,  that  same 
policy  is  optimal  for  all  Q.  Hence  if  we  assume  Q  —  0  and  determine  the 
optimal  policy  for  periods  i,  i  +  1,  •  •  •  ,  k  in  the  usual  way  (taking  ac¬ 
count  of  (20)),  that  policy  is  optimal  for  the  original  problem  where  (21) 
holds.  We  have  thus  shown  that  the  optimal  policy  in  periods 
*,  i  +  1,  •  •  •  ,  k  is  independent  of  /*+i(  • )  as  required. 

As  an  illustration  of  the  application  of  Theorem  3(a),  we  have  the  fol¬ 
lowing  result. 

Corollary  1.  If  (i)-(vii)  or  Scarfs  hypotheses  hold,  and  if 

(24)  vk(Sk,t)  g  gi+i  for  t  £  ijt, 

then  (g k ,  §k)  is  optimal  for  period  k. 

Proof.  Let  ak  -  Sk  and  ak+l  =  g*+i .  Then  apply  Theorem  3(a). 

We  remark  that  if  (24)  holds,  then  by  (v)  and  the  definitions  of  §k , 
5* ,  (*+i »  §k+ 1  (recall  3*41  *  Sk+i  if  (i)-(  vii)  hold), 

Vk(Sk  ,  t)  ^  Vk(Sk  ,  t)  ^  8k+i  £  <5*41  , 

so  (vii)  holds  for  the  integer  k.  It  follows  therefore  that  if  (24)  holds  for 
all  k,  then  (vn)  necessarily  holds  also.  In  this  event  we  can  replace  s*+i 
in  (24)  by  6*4.1 . 

Example  1.  Suppose  unsatisfied  demand  is  backlogged  in  period  k  so 
that  vk(y,  t)  —  y  —  t.  Then  (24)  reduces  to  5*  —  **4.1  Ss  n* .  Thus  if  the 
minimal  demand  in  period  A:  is  at  least  Sk  —  §*41 ,  then  (5* ,  §k)  is  optimal 
for  period  k  by  Corollary  1.  A  special  case  of  this  result  is  established  in 
[12,  p.  545]  for  the  stationary  case. 

The  next  example  illustrates  the  application  of  Theorem  3(b). 

Example  2.  Suppose  unsatisfied  demands  are  backlogged  so  that 
0  “  V  —  t-  Also  let  0*41  =  sh-  1  and  let  |a,|  be  defined  recursively 
by  o>  —  iji  -  0/4i  for;  k.  Thus  a,  =  §m  4*  £>-,  for  i  £  k.  Now 
let  t  be  an  integer  for  which 

(25)  Sf  Si  5*41  4-  £  ,  ;  “  i,  i  4-  1,  •  •  *  ,  k. 

<-/ 

Then  the  hypotheses  of  Theorem  3(b)  are  evidently  satisfied.  In  particu¬ 
lar,  if  11,  »  0  for  i  2  t  £  k,  then  the  optimal  policy  in  periods 
*,  1  +  1,  •  •  •  ,  k  may  be  determined  without  evaluating /*4i(  • )  if  5/  £  sk+t 
for  t  £  ;  £  k. 

We  remark  that  if  (i)-(vii)  hold  and  if  A,  -  0  for  all  i,  then  we  choose 
,  l< ,  and  Si  equal  to  &  s o  «<  «■  5,  -  £,  for  all  i  by  Theorem  1.  In  this 
event  the  hypothesis  (vii)  is  equivalent  to  the  hypothesis  that  (24)  holds 
for  all  k.  This  observation  together  with  Corollary  1  establishes  the  next 
result  which  is  known  from  [10],  [11]. 
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Corollary  2.  If  (i)-(vii)  hold  and  if  Ki  «  0  for  all  i,  then  {(Si  *  &)} 
is  an  optimal  policy. 

4.  Applications  and  extensions.  In  this  section  we  discuss  some  applica¬ 
tions  and  extensions  of  our  results. 

Applications  of  the  basic  lemmas.  Lemmas  1  and  2  are  useful  in  establish¬ 
ing  bounds  on  the  ordering  regions  and  order  quantities  even  where  — <?,(p ) 
is  not  unimodal.  We  shall  illustrate  this  point  under  the  assumption  that 
Ki  —  0  for  all  t,  leaving  the  other  case  to  the  reader.  Suppose  Gt(y)  ap¬ 
pears  as  in  Fig.  2.  The  domain  of  Gi(y )  is  divided  into  six  regions  labeled 
1,  2,  • •  •  ,  6.  If  period  i  were  considered  by  itself,  and  if  the  initial  inven- 
toiy  in  period  i  fell  in  an  odd-numbered  region,  it  would  be  optimal  to 
order  to  the  upper  bound  of  that  region,  viz.,  to  17,  ,  I/* ,  or  t/5  as  appro¬ 
priate.  If  the  initial  inventory  in  period  i  fell  in  an  even-numbered  region, 
no  order  should  be  placed.  If  i  <  n,  then  the  above  policy  need  not  be 
optimal  for  the  n-period  model.  However,  suppose  y  lies  in  an  even-num¬ 
bered  region.  Then  by  Lemma  1, 

(26)  J<(y)  -  Ji(y)  £  Gt(y)  -  G,(y)  £  0 

for  all  y  £  y  provided  (v)  holds  so  it  is  optimal  not  to  order  in  period  i 
for  the  n-period  model.  Notice  also  that  the  above  inequality  tells  us  that 
if  the  initial  inventory  level  in  period  i  is  o  and  if  it  is  optimal  to  order, 
then  the  inventor'  L>»el  after  ordering  must  lie  in  the  interval  [b,  c). 

If  (v)  holds,  if  Vj(  Ui,t)  £  17,  for  t  £  ij, ,  and  if  Gj(y)  is  nonincreasing 
in  y  on  (  —  I/,]  for,;  Jg  i,  then  by  Lemma  2,  Ji(y)  is  minimized  on 

(—  »,  Ui\  at  y  =  Ux .  Similarly  from  (26),  Ji(y)  is  minimized  on  [(/, ,  » ) 
at  y  =  17, .  Combining  these  remarks  we  see  that  Ji(y)  is  minimized  on 
( —  oe,  oo )  at  y  »  Ui .  Hence,  in  region  1  it  is  optimal  to  order  up  to  17, . 

Variation  of  the  bounds  over  time.  It  is  of  interest  to  determine  how 
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and  Si  vary  over  time  in  relation  to  the  variation  of  the  cost  functions  and 
demand  distributions.  Although  this  appears  to  be  quite  difficult,  we  can 
instead  examine  how  the  bounds  on  s,  and  <S,  vary  over  time.  Such  studies 
are  of  interest  in  their  own  right  and  because  they  provide  us  with  a  tool 
for  determining  conditions  under  which  the  hypotheses  (vii),  (17)-(20), 
(24),  (25)  of  our  several  results  hold.  Throughout  this  subsection  we 
shall  assume  for  simplicity  that  sufficient  regularity  conditions  are  im¬ 
posed  to  permit  differentiation  and  interchange  of  differentiation  with 
integration  where  required. 

As  a  preliminary  we  record  several  lemmas  from  [11].  Let  I  be  a  subset 
of  the  integers  1,  •  •  •  ,  n. 

Lemma  3.  If  dW ,(y,  t)/dy  is  noninersasing  in  t  ^  ij,  and  i  £  I  for  each 
y,  and  if  <l>,(0  ^  <Pj(t)  for  all  t  and  i,  j  £  I,  i  <  j,  then 

(27)  Gi'(y)  £  Gt’(y),  i,j  €  /,  i  <  j,  and  ally. 

Lemma  4.  If  Wi(y,  t )  =  Wf(y  —  t)  +  W,*(t)  for  some  functions  W,\ 
W*,  if  dWi(z)/dz  is  nonincreasing  in  i  6  /  and  nondecreasing  in  z,  and 
if  $,(<)  <£  $;(<  —  bn)  for  all  t  and  i,  j  6  /,  i  <  j,  and  some  numbers  6,y , 
then 

(28)  Gi’(y)  £  G/(y  -  &„•),  i,j  6  /,  i  <  j ,  and  all  y. 

Lemma  5.  7/X  *  0,  if  Wt(y,  t )  =  W\y  —  f)  +  W,*(<)  for  some  func¬ 
tions  Wl,  W,*,  and  if  4»,(<)  =  4>(<  —  in)  for  all  t  and  i  £  I  for  some  distribu¬ 
tion  4>,  then 

(29)  Gi( y)  =  G(y  -  i »,)  +  Q> ,  i  €  /,  and  all  y, 
where  Qi  is  a  constant  and  G{y)  *  /  W{y  —  t)  d#(t). 

J—sc 

In  the  remainder  of  this  subsection  we  assume  for  simplicity  that  Ki  =*  K 
and  o<  *  a  ( £  1 )  for  all  t,  and  X  *  0.  Our  methods  can  be  applied  with¬ 
out  these  hypotheses,  but  not  without  expanding  the  exposition.  See  in 
particular  [11]  for  conditions  under  which  the  hypotheses  of  Lemmas  3 
and  4  are  satisfied  when  X  >  0.  There  is  also  an  analog  of  Lemma  5  when 
X  >  0. 

Let  8,'  ■  (§< ,  1, ,  §i  i  5,),  Bij  ~  (btj  t  b,j ,  6,/ ,  6,;),  and  Hi  *  (*?< ,  *ji , 
iji  i  n<).  Let  8  -  (},  J,  S,  5),  where  j,  I,  §,  S  are  defined  for  the  function 
G(  ■ )  (see  Lemma  5)  in  the  usual  way.  For  definiteness  where  8<  is  not 
uniquely  defined  we  choose  it  as  follows.  First  pick  the  smallest  possible 
Si .  Then  pick  the  smallest  j, ,  I, ,  and  5, .  Do  the  same  for  8.  The  follow¬ 
ing  theorem,  which  is  an  easy  consequence  of  Lemmas  3-5,  describes  how 
8,  varies  over  time  in  relation  to  the  variation  of  Wi  and  ♦,  (as  reflected 
in  Gt)  over  time. 
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Theorem  4.  Suppose  (i)-(iv)  hold. 

(a)  If  (27)  holds,  then  S,  g  Sy  for  i,j  £  I,  i  <  j. 

(b)  If  (28)  holds,  then  —  B,j  £  Sy  for  i,j  £  I,  i  <  j. 

(c)  If  (29)  te/ds,  (ten  St  —  H,  =  S /or  i  €  J. 

Satisfying  the  hypotheses  of  the  main  results.  In  this  subsection  we  use 

Theorem  4  to  give  conditions  under  which  the  important  hypotheses 
(vii)  and  (24)  of  Theorem  1  and  Corollary  1  respectively  are  satisfied. 
We  begin  by  giving  conditions  under  which  (vii)  holds. 

It  will  be  convenient  in  what  follows  to  assume  that  there  is  an  extended 
real  number  9  such  that 

(30)  v,(y,  t )  ^  max  (6,y  —  ij,),  for  t  i), ,  all  t,  and  all  y. 

As  an  example,  if  ij,  2:  0  and  unsatisfied  demands  are  backlogged  so 
Vi(y,  t)  —  y  —  t,  then  (30)  holds  with  9  2;  —  w.  Alternatively  if  ij,  ^  0 
and  if  unsatisfied  demands  are  lost,  so  Vi(y,  t)  =  max  ( y  —  t,  0),  then  (30) 
holds  with  9  ^  0.  In  applications  9  should  be  chosen  as  small  as  possible. 
Thus  6  -  —  *  in  the  backlog  case  and  9  —  0  in  the  lost  sales  case. 

The  following  result  is  a  simple  consequence  of  Theorem  4. 

Corollary  3.  Suppose  (i)-(iv)  and  (30)  hold,  I  =  [1,  2,  •  •  •  ,  n},  and 
S,  sE  9  for  all  i  >  1. 

(a)  If  (27)  holds  and  ij,  £  0  for  all  i  <  n,  then  (vii)  holds. 

(b)  If  (28)  holds  and  y,  —  6,.,+1  £  0  for  all  i  <  n,  then  (vii)  holds. 

(c)  If  (29)  holds  and  ij,  ^  0  for  all  i  $  n,  then  (vii)  holds. 

The  next  corollary  gives  a  condition  ensuring  that  the  hypothesis  (24) 
of  Corollary  1  holds. 

Corollary  4.  If  (i)-(iv),  (vii),  (30)  hold,  if  (28)  holds  vnih 
I  -  \k,  k  +  IJ,  if  §k+i  £  9,  and  if 

(31)  5*  —  5*  £  s*  —  bk.h+i » 

then  (24)  holds. 

Proof.  «i*(S*,  t)  5  max  (9,  St  -  m)  £  max  (9,  s *  - 
£  max  (fl,  g*+i)  -  s*+l  -  9*+t . 

Stationary  infinite  horizon  models.  This  paper  is  primarily  concerned 
with  a  finite  horison  model.  If  the  model  is  stationary,  i.e.,  Ot ,  K%,  at,  ♦, 
are  independent  of  t,  then  it  is  convenient  to  consider  an  infinite  period 
version  of  the  model.  In  this  case  fairly  obvious  modifications  of  Iglehart’s 
results  and  proofs  [11,  [2]  (see  also  [12,  pp.  530—531])  for  0  £  a  £  1  show 
that  if  (i)-(vii)  hold,  there  is  an  optimal  («,  S)  policy  with  the  optima! 
choioe  of  parameters  being  independent  of  time  and  satisfying  (6),  (7). 
Methods  for  computing  these  parameters  are  discussed  in  [8]  and  [12]. 

Restrictions  on  inventory  levels.  In  some  applications  it  may  be  desirable 
to  limit  the  choice  of  the  inventory  y,  on  hand  and  on  order  after  ordering 
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in  period  i(  -  1,  2,  •  ■  •  ,  n)  to  an  interval  [y, ,  y,]  say.  The  upper  bound 
y,  might  reflect  limitations  on  storage  space  while  the  lower  bound  y,  could 
reflect  a  desire  to  limit  the  size  of  the  backlogged  demand.  As  a  specific 
illustration,  suppose  demands  occur  over  only  the  first  n  —  1  periods,  so 
Di  *  0, »  £  n.  Then  we  may  wish  to  requiie  that  no  unsatisfied  demand 
exist  at  the  end  of  period  n  +  X.1  This  may  be  accomplished  by  setting 
Vn  m  0  so  y„  &  0.  This  implies  yn+\  £  0  if  v,(y,  0)  £  0  for  y  £  0  and 
n  £  i. 

In  other  applications  it  is  natural  to  suppose  that  the  demands  are 
integers.  Of  course  this  restriction  is  already  allowed  in  our  formulation. 
However,  in  such  cases  it  is  usually  necessary  to  impose  the  additional 
restriction  that  the  order  quantities  and  stock  levels  be  integers.  We  shall 
now  generalize  our  original  model  and  results  to  provide  for  such  integer 
restrictions  and  for  bounds  on  the  stock  levels. 

Let  Yi  denote  the  nonempty  set  of  admissible  stock  levels  y,  on  hand 
and  on  order  after  ordering  in  period  i.  Let  y,  -  inf  F,  and  y,  =  sup  F, . 
Let  D,  denote  the  (Borel)  set  of  possible  values  of  the  demand  D ,  in  period 
».  Let  Xi+i  denote  the  nonempty  set  of  possible  values  of  the  stock  on  hand 
and  on  order  before  ordering  in  period  i  +  1.  We  naturally  impose  the 
consistency  condition  that  vt(y,  t)  £  X<+i  for  all  y  £  F,  and  i  €  2D, .  In 
addition  we  suppose  that  if  the  stock  on  hand  and  on  order  before  ordering 
in  period  i  is  at  least  y,  in  period  i,  then  it  is  possible  not  to  order  in  period 
i.  Formally,  we  assume  that  r  £  inland  y,  £  x  imply  x  £  F, ,  i  =  1, 2,  •  • ,  n, 
where  X\  ■  {xi}.  We  also  suppose  that  the  domains  of  /,( • ),  G,(  • ),  J,{  • ), 
and  »,•(■,•)  are  respectively  X, ,  F, ,  F, ,  and  F,  X  2D, .  Moreover,  we 
shall  replace  (i)-(iii)  respectively  by: 

(i*)  (i)  holds  and  F,  is  closed; 

(ii')  either  (ii)  holds  or  <  *  ; 

(in')  either  (iii)  holds  or  —  *  <  y, . 

If  (i')  holds,  we  may  define 

G,+(y)  *  6,  (inf  \t  |  z  >  y,  z  £  F,|) 

*or  y  <  9x  and  (7,+(y,)  -  ®  if  y,  <  Also 

G,~(y )  -  <7,(sup(*|*  <  y,  i  £  F.j) 

for  y,  <  y  and  (?,"(y.)  “  ®  if  -  ®  <  y, .  For  example,  if  F,  -  (—«,«), 
then  Gi~(y)  -  G,(y)  »  G+(y),  while  if  F,  is  the  set  of  integers,  then 
G,(y  -  1)  and  G+(y)  -  G,(y  +  1)  for  y  £  F, . 

If  (i')-*(iii#)  hold,  there  are  a  number  £  F,  that  minimises  (?<(•)  on 
F,  and  numbers  $,  ( £§,)  and  S,  (  £  §,)  such  that 

G,(S.)  S  G,(&)  +  a,K1+l  £  (7,+(S.) 


*  I  &m  indebted  to  CJ.  Lieberman  for  a  discussion  of  this  application. 
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and 


CrM  &  (?<(§.)  +KiS  ariti). 

If  in  addition  (vi)  holds,  there  is  a  number  ,  j,  JS  I,  £  ,  such  that 

Gi(ii)  ^  Gi(§ ,)  +  K,  —  aJKi+1  ^  Gi  (I,). 

It  is  easy  to  check  that  the  statements  and  proofs  of  Lemmas  1  and  2 
remain  valid  in  our  new  setup.  Also  Theorem  1  holds  provided  we  replace 
(iy-(iii)  by  (i')-(iii')  and  replace  (7)  by 

(7')  arM  £  Gt(Si)  +  (Kt  -  ajci+l). 

Only  obvious  modifications  of  the  proof  of  Theorem  1  are  required. 
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1.  Introduction  and  Summary 

For  nany  probleas  in  applied  probability  it  is  difficult  to  obtain  explicit 
expressions  for  the  distributions  of  random  quantities  of  interest.  In  some 
probleas  however,  approximations  to  these  distributions  can  be  obtained  froa 
limit  theoreas  as  a  particular  physical  paraaeter  approaches  a  Hail.  These 
approximations  are  siailar  in  spirit  to  the  normal  approximation,  for  suae-  of 
independent  random  variables,  resulting  from  the  central  limit  theorem.  Our  pur¬ 
pose  in  this  expository  p&pcr  is  to  sketch  the  general  approach  to  these  limit 
theoreas  and  approxiaations  and  to  mention  a  number  of  techniques  which  have 
proved  to  be  useful  in  various  probabilistic  models. 

Tho  starting  point  for  us  is  a  given  sequence  of  stochastic  processes 
(Xn(k):  k  «  6.1,...}  for  n  «  1,2,...  with  each  process  defined  on  its  own 
probability  space  (°n'W^n'Pn)*  h./e  indicated  a  discrete  time  parameter  for 
these  processes,  however,  this  is  not  crucial  and,  m  fact,  we  shfcll  later  con¬ 
sider  processes  with  a  continuous  time  parameter.  The  state  space  of  our 
processes  can  be  either  discrete  or  continuous  and  either  real-valued  or  vector- 
valued.  The  index  n  for  the  sequence  is  meant  to  correspond  to  the  physical 
paraaeter  mentioned  above  In  some  models,  however,  the  sequence  index  will  not 
be  associated  with  the  positive  integers.  With  chi'-  set-up  given,  our  aim  is  to 
scale  the  time  parameter  and  to  translate  and  scale  the  space  variable  in  such  a 

1  This  work  was  supported  by  the  Office  of  »*.*w*i  Research,  Contract  Konr-401  (SS) . 

2.  This  paper  was  prepared  for  the  American  Mathematical  Society  1967  Summer 
Seminar  on  MithoMtics  of  (Ho  Decision  Scitncd  to  Ho  Hold  st  Stsoford  Univer¬ 
sity  from  July  10  to  August  11,  196? 
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V  * 


manner  that  the  resulting  processes  converge  to  a  limit  process  as  n  goes  to 


infinity.  In  general  we  shall  seek  sequences  {a  },  {b  },  and  {c  }  to  form  the 

n  n  n 

3 

processes 


The  an's  and  cn's  will  be  positive  real  numbers,  generally  tending  to  infinity 

as  n  *  •».  However,  the  b  '  s  will  be  vectors  if  the  X  's  are.  Notice  that 

n  n 

the  Y  (•)  processes  will  be  constant  for  stretches  of  length  a^  because  of 
the  discrete  nature  of  the  Xn's  and  hence  have  discontinuous  paths.  There  is 
an  alternative  way  to  define  the  Y  (•)  processes  which  leads  to  continuous 
path  functions.  This  latter  approach  has  certain  advantages  and  will  be  intro¬ 
duced  later. 

There  are  a  number  of  modes  of  convergence  for  the  Yr ( * )  processes  which 
are  of  interest.  The  simplest  of  these  is  to  require  convergence  in  distribution 
of  the  value  of  the  Y^( • )  processes  at  a  fixed  value  of  t;  i.e.. 


lim  pn(Yn(t)  <,  x}  *  P{Y (t)  <_  x}  for  all  t  >,  0 

n-*» 


and  all  x  for  which  the  right-hand  side  is  continuous,  where  Y(.)  is  a  limit 
process  defined  on  a  probability  space  (fl,5T,P).  Often  the  limit  process, 

Y(*)i  is  a  diffusion;  i.e.,  a  strong  Markov  process  with  continuous  path 
functions.  The  classical  central  limit  theorem  is  of  this  type,  where  Y( * )  is 
Brownian  motion.  Next  we  might  be  interested  in  showing  the  convergence  of  the 
finite-dimensional  distributions  (f.d.d.)  of  the  Y  (•)  processes  to  the  cor¬ 
responding  distributions  of  the  Y(‘)  process.  In  other  words,  for  every  k  >  1 

3.  The  symbol  [x]  denotes  the  integer  part  of  x. 


f 


* 


and  0  £  tj  <  <  . . .  <  we  would  like  to  show  that  the 
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lim  P^VV  «  V  ^'VV  -  xk}  -  P{Y(tj)  <  , . . .  ,Y(tk)  <  xk} 

n-H» 


for  all  Xj  for  which  the  right-hand  side  is  continuous. 

A  third  mode  of  convergence  which  is  important  for  applied  problems  is  weak 
convergence  of  a  sequence  of  probability  measures  defined  as  follows.  Let  S 
be  a  metric  space  and  be  the  smallest  Borel  field  containing  the  open  sets  of 


S.  If  v 


and  v  are  probability  measures  on  \Ji  and  if  the 


lim  /  f  dv  *  f  f  dv 
n-*»  S  n  S 

for  every  bounded,  continuous,  real-valued  function  f  on  S,  then  we  say  vn 
converges  weakly  to  v  and  write  vn  =>  v. 

For  our  problems  the  measures  vn  will  be  generated  by  the  Y  (•)  processes 
and  the  measure  v  by  Y(°).  In  most  cases  the  metric  space  S  will  be  C[0,1], 
the  space  of  continuous  functions  on  [0,1]  with  the  metric  of  uniform  conver¬ 
gence,  or  a  multi -dimensional  analog.  Weak  convergence  is  important  because  it 
implies  convergence  in  distribution  of  certain  functionals  of  the  original  pro¬ 
cesses.  Sometimes  in  applications  the  distribution  of  the  functional  is  more 
important  than  that  of  the  original  process. 

Once  these  limit  theorems  have  been  obtained  we  can  consider  using  the 
limit  distributions  as  an  approximation  to  the  distribution  of  Xn(*)  when  the 
parameter  n  is  sufficiently  "large,"  Of  course,  the  question  of  how  large  n 
must  be  before  a  "good"  approximation  is  ->btained  introduces  the  question  of 
rates  of  convergence  for  the  limit  theorems,  little  work  of  an  analytical  nature 
has  been  done  on  the  rate  of  convergence  issue,  however,  for  some  models  numeri¬ 
cal  results  are  available. 


The  protype  of  these  Unit  theorems  and  their  resulting  approximations  is 
the  convergence  of  sums  of  independent,  identically  distributed  (i.i.d.) 
random  variables  to  Brownian  motion.  These  results  will  be  sketched  in  Section 
2.  Models  from  queueing  theory  will  be  taken  up  in  Sections  3,  4,  and  5. 
Section  6  will  be  concerned  with  the  multi-um  Ehrenfest  model.  A  quality 
control  model  will  be  discussed  in  Section  7.  Finally,  in  Section  8  brief 
mention  will  be  made  of  other  work  on  the  convergence  of  processes  in  applied 
probability. 

2.  Sums  of  Random  Variables  and  Brownian  Motion 

Let  Xj .Xj, . . o  be  a  sequence  of  i.i.d.  random  variables  defined  on 
the  product  probability  space  (Q,«&P)  and  having  mean  0  and  variance  1, 

We  shall  denote  the  partial  sums  by  S^  *  X^.o.+X^  and  set  SQ  «  0.  The 
appropriate  sequence  of  processes  to  consider  is  defined  by 

Yn(t)  »  ,  0  <  t  <  1,  n.  1,2....  . 

✓n 

The  fact  that  we  have  restricted  the  time  parameter  t  to  the  unit  interval 
is  not  important,  but  does  make  the  exposition  easier.  Observe  that  for  a 
fixed  value  of  t  the  number  of  jumps  of  the  Yn(°)  process  is  of  order  n, 
the  mean  value  of  each  jump  is  0,  and  the  variance  of  each  jump  is  n'*. 

Thus  the  jumps  are  occurring  very  often  and  the:..*  heights  are  becoming 
small  as  n  grows  large  Hence  it  is  natural  to  hope  that  as  n  ♦  •  the 
paths  of  Y  ( *)  will  become  continuous. 


4a 


Alternatively,  we  could  consider  a  sequence  of  processes  defined  by 

-1  -1 

Z  ft)  ■  —  ♦  (nt-k) - ,  kn  1  <  t  <  (k+l)n  1 

n  G  n  “ 

for  k  ■  0,1,..., n-1.  Notice  that  Zn (t)  *  Yn(t)  for  t  of  the  fora 

kn"1,  k  «  0,...,n,  and  is  defined  by  linear  interpolation  for  other  values 

of  t.  Hence  the  paths  of  Zn(*)  are  automatically  continuous  and  this  is 

convenient  when  considering  weak  convergence  of  probability  measures. 

We  shall  be  interested  in  showing  that  the  sequences  (Yn(*))  and 

{Zn(.)}  converge,  as  discussed  in  Section  1,  to  Brownian  motion,  Y(*)»  on 

the  unit  interval.  As  a  check  to  see  that  we  are  on  the  right  track,  we 

calculate  the  infinitesimal  mean  and  variance  of  Y  (t)  per  unit  tiae. 

n 

2 

Denoting  these  means  and  variances  by  «n(y)  and  an(y) ,  we  have 
mn(y)  -  n  E(Yn(t*i)  -  YJt)  |Yn(t)  -  y> 

«  n  E  -  0 

Sn 


9 


5 


and 


?(y)  ■  "  E(['n(t»i)  -  Vn(t))2|Yn(t)  .  y) 


E{W1> 


This  infinitesimal  mean  and  variance  agrees  with  those  of  Brownian  motion  which 
provides  a  useful  check  before  nroceeding  to  a  rigorous  analysis. 

The  proof  of  the  convergence  of  the  one-dimension  distributions  is  simply 
the  central  limit  theorem;  i.e., 

lim  P{Y  (t)  <  x}  «  (2irt)-1^2  /  exp{-  -|r}  ds 

n-*» 


The  normal  distribution  on  the  right-hand  side  is  also  the  distribution  of  the  posi¬ 
tion  of  firownian  motion  at  time  t.  To  show  convergence  of  the  f.d.d.  the  Levy 
continuity  theorem  for  characteristic  functions  can  be  used.  The  case  k  *  2 
embodies  the  general  argument  which  goes  as  follows.  Let  ♦n(si»s2;ti*t2^  be 
the  joint  characteristic  function  of  Yn(tj)  and  (t j) •  Then 


VWW  *  ElexP{ 


i(s1*s2) 


is. 


Letting  n  •  we  see  that 

(s.*s  )2t  s2(t  -t  ) 

VWW  *  exp{ - 2 - - 2 — ~  }  • 


which  is  the  joint  characteristic  function  of  Brcwnian  motion  at  times  tj  and 
t2<  Similar  arguments  can  be  used  to  show  the  convergence  of  the  f.d.d.  of  the 
Zn()  processes. 
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To  establish  weak  convergence  of  the  measures  associated  with  (Yn(*)}  or 
{ f * ) >  two  steps  are  required.  The  first  step  is  the  convergence  of  the  f.d.d. 
which  we  have  demonstrated  above.  Secondly,  the  probability  that  the  approximat¬ 
ing  processes  can  have  large  fluctuation  between  points  at  which  they  are  deter¬ 
mined  by  their  f.d.d.  must  be  shown  to  be  small.  The  notation  of  weak  conver¬ 
gence  is  intimately  related  to  the  so-called  invariance  principles.  An 
invariance  principle  for  sums  of  i.i.d.  random  variables  states  roughly  that 
the  limit  of  the  distribution  of  various  functionals  of  the  S^'s  is  independent 
of  the  common  distribution  of  the  XJs,  provided  the  mean  is  0  and  variance 
1.  Such  an  invariance  principle  was  first  given  by  Erdos  and  Kac  (1946)  and 
later  generalized  by  Oonsker  (1951)  and  Billingsley  (1956).  To  carry  out  the 
second  step  mentioned  above  for  {yo>  (respectively,  { Zn (•>)))  the  reader 
should  consult  Donsker  (1951)  Billingsley  (1956)).  Other  important  references  for 
weak  convergence  of  sums  of  i.i.d.  random  variables  are  Prokhorov  (1956), 
Skorokhod  (1956),  and  Ito-McKean  (1965). 

It  is  important  to  note  that  while  the  approximating  processes  could  all  be 
defined  on  a  single  probability  space  the  limiting  Brownian  motion  was  defined  on 
another  probability  space.  Thus  with  this  set-up  it  makes  no  sense  to  speak  of 
convergence  with  probability  one  or  in  quadratic  mean,  for  example.  However, 
when  the  X^'s  are  Bernoulli  random  variabiestKnight  (1962)  has  succeeded  in 
defining  the  approximating  processes  and  the  limit  process  on  a  single  probability 
space  and  then  shown  probability  one  convergence.. 

3.  Waiting  Tine  fox  the  Queue  GI/G/1 

In  this  section  we  shall  consider  the  u. smbution  of  the  waiting  time  in 
the  single-server  queue  with  general  independent  input  and  general  service  tine. 
Our  objective  is  to  demonstrate  the  usefulness  of  the  notion  of  weak  convergence 
in  studying  the  asymptotic  behavior  of  the  waiting  tine  as  the  traffic  intensity 
o,  goes  to  1.  This  discussion  is  based  on  work  of  Prokhorov  (1956,  1963). 
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Let  Wn  be  the  waiting  tine  (tine  before  service  begins)  for  the  n**1 

customer.  We  shall  denote  the  service  time  of  the  n  customer  by  v  and 

n 

the  interval  between  the  arrival  times  of  the  nth  and  (n*l)st  customers  by 

u  .  Then  if  W,  ■  0, 
n  l 

(1)  Pn+1(*)  -  pT{Wn+iix)  *  p^s!  <  x.  S2  <  x,  ....  Sn  <  x}, 

i 

where  X  ■  v  -  u  and  S.  ■  T  X..  This  result  was  first  derived  by  Lindley 
n  n  n  i  J 

(1952);  see  Prabhu  (1965)  for  a  comprehensive  discussion  of  the  GI/G/1  queue. 

The  usual  independence  assumptions  regarding  (vn)  and  {u^)  imply  that  the 
X^'s  are  independent.  Since  the  distribution  of  is  seen  to  be  equal  to 

the  distribution  of  the  maximum  functional  on  the  process  of  partial  sums,  , 
it  is  natural  to  hope  for  limit  theorems  which  lead  to  the  maximum  functional 
on  Brownian  motion. 

Consider  now  a  sequence  of  such  queueing  processes  indexed  by  6,  in  which 

(«4)  («*) 

E{u^  }  ■  l/(l-6^),  E{vn  )  »  1,  and  hence  ■  1-6^.  Thus 

(«J 

E(Xn  )  •  -6^  /(l-«i).  We  shall  treat  the  case  >  0  (or  pa  <  1)  here, 

although  for  many  of  the  arguments  the  sign  of  6^  is  not  important.  Our  goal 

(«.) 

now  is  to  obtain  limit  theorems  for  W  as  6 .  Ns  0  and  n,  ♦  •.  There  are 

n^  l  x 

2 

a  number  of  possible  limit  theorems  depending  on  whether  converges  to  zero, 

a  positive  constant,  or  plus  infinity. 

Theorem  5.1  of  Prokhorov  (1956)  is  the  principal  tool  used  in  obtaining 
these  limits  theorems.  The  set-up  for  the  theorem  is  as  follows.  A  double 
sequence 


Xn,l*  Xn,2 . Xn,kn 

of  random  variables  is  given  which  are  independent  for  each  n  and  subject  to 
the  asymptotic  negligibility  condition 


for  all  c  >  0. 


lia  max  Pr{ | X  fc|  >  e)  -  0 
n—  l<k<kn  ’ 

k 

n 

The  first  two  eoaents  satisfy  E{X„  .  }  ■  0,  o2  .  ■  o2[X  .  }  >  0,  and  T  o2  .  ■  1. 

n,K  n,K  n#k  n#ic 

k 

Let  the  partial  suas  be  denoted  by  Sn  ^  ■  0  and  Sn  ^  ^  for 

2  ^ 

1  i  k  i  kn*  *e  let  tn  j(  ■  0  [sn  jj]  construct  the  continuous  path  function 
Y  (t)  which  is  piece-wise  linear  with  vertices  at  the  points  (t  ,  ,  S  v).  The 
aeasure  induced  by  Yn(t)  on  C[0,1]  we  shall  denote  by  Pfl  and  that  induced 
by  Brownian  notion  by  P.  Then  the  theorea  is  as  follows. 

A  necessary  and  sufficient- condition  for  Pn  to  convene  weakly  to  P  is  that 
k” 

(2)  li«  I  /  x^F,  v(x)  -  0 

n-k-l  |x|>»  "-k 

for  all  X  >  0,  where  F_  L(x)  ■  Pr{X_  u  <.  x}. 

* n  I  K  n  I  K  * * 

2 

First  consider  the  case  where  n^  ♦  0  as  i  •.  We  shall  assuae  that 

2  (V 

o  [X  ]  ■  0^  >  0,  o.  -*  o,  and  that 

,  2  (V 

(3)  /  x*dK  1  (x)  -  0 

Mi* 

(«,)  («x) 

uniforaly  in  6.  as  z  ♦  •,  where  K  is  the  distribution  inaction  of  5 

1  R 

The  latter  condition  assores  us  that  condition  (2)  is  satisfied  for  our  queueing 
problea.  Froa  (1)  we  have  for  0  <  i  <  • 

(6i> 

(V  r-  «V  Sk 

(<}  •  Pr<*n  <  xoj  •  Pr{ — *  x,  1  <Ji  <  Rj)  . 

i  i 

One  can  easily  check  to  see  that  the  conditions  of  the  abort  theorea  hold 


and  hence 


(5) 
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Pr 


-  («i)  («i)  -i 

f  S.  1  k  E{X.  }  1 

<  — - <x  ♦ - - -  ,  l<k<n> 

l°i^7  J 


converges  as  i  •  to  the  naxinun  functional  of  Brownian  notion,  namely 


P{  nax  Y (t)  <  x}  , 
0<t<l 


where  Y(t)  is  the  path  function  of  Brownian  notion.  This  probability  is  known 
to  be  2t(x)  -  1,  the  truncated  nornal  distribution.  Since  we  have  assumed  that 


n^6*  0,  the  terns 


(6) 


(«,) 

k  E{X.  1  } 
°i 


k  6 . 


°i'^T 


>  0  as  i 


for  k  »  l,2,.,.,n^.  Hence  putting  together  (S)  and  (6)  yields 


<«,) 


\i,  <*  °iS>  '  2*(,)  • 1 


This  result  is  of  course  also  true  in  the  case  where  6^  ■  0. 

2 

Next  we  assume  ♦  t,  0  <  t  <  «».  Using  (1)  again  we  have 

UI*1,I*,|  1  •  Hr  •  "i  I 

V,W,il  •  o.^  i  o .JTt. 


(4<) 


v  i^  1 

By  assumption  -njE[Xj  1 6^  ♦  t  and  oi  •'rTT  6  ♦  o  fx  .  Using  the  weal  conver¬ 
gence  again  we  have 


(7) 


(4.) 


ix  IW)  •—)<  —  ) 

V1  0<t<l  o/T  o/T  * 


By  a  change  in  tine  scale  the  last  probability  can  also  be  written  as 


i 
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(8)  P^aax  [Y (t)  -  |]  <  i  }  . 

2 

Finally,  consider  the  case  where  n^6^  -»  ».  We  would  expect  the  limit  of 

C«i> 

F  ,  (x/ 5 , )  to  be  the  expression  in  (8)  with  t  replaced  by  •.  This  is  in 

*  i  * 

fact  true,  although  an  additional  argument  employing  Kolmogorov's  inequality  is 
required.  For  a  fixed  6  >  0,  the  stationary  distribution  of  the  Markov  chain 
exists  and  is  given  by 

F(6)(x)  -  Frtsup  s£6)<  x} 

k>l  *" 

cm 

In  the  course  of  deriving  (8)  with  r  replaced  by  •,  Prokhorov  also  shows  that 

(6 . )  2 

(9)  F  1  (x/3.)  -  P{  max  (Y(t)  -  xo*1}  ■  1  -  e"2x/°  . 

0<t<- 

The  last  equality  was  derived  by  Darling  and  Siegert  (1953).  The  result  given 
in  (9)  was  first  derived  by  Kingman  (1%2).  For  an  expository  account  of 
Kingman's  work  in  this  area  of  so-called  "heavy  traffic"  the  reader  should  con¬ 
sult  Kingman  (1965). 

From  this  work  of  Prokhorov's  it  should  be  clear  that  the  notion  of  weak 
convergence  is  an  important  one  for  applied  probability.  For  other  work  in  this 
spirit  consult  Viskov  (1964),  Viskov  and  Prokhorov  (1964),  and  Porovkov  (1964). 

4.  The  Many  Server  Queue  and  Telephone  Trunking  Problem 

In  this  sect  ion  we  shall  discuss  the  many  server  queue  and  telephone  trunk¬ 
ing  problem  (infinite  server  queue)  with  Poisson  arrivals  and  exponential 
service  time.  As  usual  the  service  time.’  are  independent  of  the  arrival  process 
and  no  server  is  idle  if  a  customer  is  waiting.  If  there  are  n  servers  and 
we  let  *n(t)  denote  the  number  of  customers  waiting  or  being  serve*'  at  tie* 
t,  then  it  is  well  known  that  *n(t)  1*  *  birth  a?vd  death  process.  A*  such 


n 


the  transition  probabilities,  p^(t)  *  Pttn(t+r)  ■  j|Xn(r)  "  can  be 
expressed  by  the  integral  representation  of  Karlin  and  McGregor  (1958).  If  we 


knew  p^  (t) 


explicitly,  we  would  in  principle  be  able  to  calculate  all  the 


distributions  of  interest  for  this  model.  Unfortunately,  it  is  exceedingly 


difficult  to  compute  p„(t)  when  n  is  larger  than  3  or  4. 

Motivated  by  this  difficulty,  we  shall  seek  a  diffusion  approximation  for 
Xn(t)  when  n  is  large.  We  shall  follow  the  discussion  of  the  author 
(Iglehart  (1965)).  Naturally,  if  we  keep  the  expected  interarrival  time  and 
the  expected  service  time  fixed,  the  behavior  of  the  many  server  queue 
approaches  that  of  the  telephone  trunking  problem.  While  the  transition  prob¬ 
abilities  for  the  telephone  trunking  problem  are  well  known,  we  would  have  lost 
the  characteristics  of  the  original  problem.  In  fact,  the  traffic  intensity  p, 
defined  to  be  the  ratio  of  the  arrival  rate  to  n  times  the  service  rate,  would 


tend  to  zero  whereas  the  processes  being  approximated  have  a  positive  traffic 
intensity.  Therefore,  we  shall  let  the  arrival  rate  become  large  with  the  number 
of  servers,  and  keep  the  service  time  constant  in  such  a  manner  that  p  is 
maintained  at  a  fixed  value  less  than  one.  To  be  specific  we  let  xn(t)  be  the 
birth  and  death  process  with  parameters 


(n) 

X.  «  np 

i 


for  i  ■  0,1,...  and  n  -.1,2,..,,  where  0  <  p  <  1. 

As  a  heuristic  aid  to  setting  up  the  appropriate  approximating  processes, 
consider  the  imbedded  jump  process  of  Xn(t).  The  expected  displacement  in  one 
jump  starting  at  state  i  <  n  is  (np-i)/(np*i)  which  is  non-negative  if 
i  _<  np  and  negative  if  i  >  np.  For  i  >  n  the  expected  displacement  is  always 
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negative.  In  other  words,  the  state  [np]  is  an  "equilibriua  point"  of  the 
process.  Thus  for  our  approximating  processes  it  is  natural  to  consider  the 


fluctuations  of  Xn(t)  about  (np],  neasured  in  an  appropriate  scale.  We  shall 


Xft)  -  np 

Y  (t)  ■  ■  «/*■'“  ,  n  *  1,2, .. .  . 

(np)J/^ 

Again  as  a  guide  to  the  Uniting  diffusion  process,  we  shall  calculate  the 
infinitesimal  mean  and  variance  of  Yn(t).  The  infinitesimal  mean  is  defined  in 


terms  of  the  jump  process  ^(k),  say,  as 


■„(y)  ■  h{Yn(k*l)  -  ^(k)  j^U)  •  y)  /  ’’{holding  time  in  y}  . 


In  our  case  if  we  let  <*n(y)  *  [np  +  (np)'/  y],  then 

1  x(n)  .  u(») 

.  M _ _  SW  an<y)  ,.(n)  +  (n)  . 

"  ^171  \(y)  ■ 

01  «„(>)  \(y)  "  " 

We  now  take  n  so  large  that  “n(y)  <  n  (which  is  possible  since  p  <  1).  Thus 

1  1/2 

(10)  n  (y)  - - -  (np  -  [np  ♦  (np)  '  y)}  ♦  -y  as  n  -*>  •. 

(np)1'^ 

The  infinitesimal  variance  is  similarly  defined  as 

(11)  o2(y)  -  E{(Yn(k*l)  -  Yn(k))2|  ?n(k)  -  y)  /  E(holding  time  in  y)  . 


J  X(nJ  *  ♦  y(nJ 

2  °n(y)  «  (y) 

°n(y)  "  ~  ”(n 5 - 77n) 

(RP)  \w 


.  +  u(«)  ) 
Vy)  > 


“  (np  +  [np  ♦  (np)1/2y])  -►  2 


as  n 


Hence 
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The  liait  process  should  then  be  governed  by  the  backward  equation 


3u 

3t 


*2» 

I? 


•  <  x  <  •  , 


which  is  recognized  as  the  equation  of  the  Ornstein-Uhlenbeck  diffusion  process. 

At  this  point  we  are  in  need  of  a  technique  for  showing  that  the  distribution 
of  Yn(t)  converges  to  the  distribution  of  the  Ornstein-Uhlenbeck  process 
Y(t),  say.  If  we  could  explicitly  obtain  the  representation  for  p^ft),  we 
night  use  analytic  techniques  involving  orthogonal  polynomial  to  achieve  our 
result.  While  the  parameters  (ir. }  and  the  orthogonal  polynomials  (Q. (x) } 

J  1 

can  be  easily  obtained,  the  measure  <Kx)  is  difficult  to  characterize.  Ir  fact, 
the  difficulty  in  obtaining  this  representation  provided  our  initial  motivation 
for  looking  for  a  diffusion  approximation.  Fortunately,  there  is  a  general 
theory  due  to  Stone  (1961,  1963)  which  gives  necessary  and  sufficient  conditions 
for  the  weak  convergence  of  a  sequence  of  birth  and  death  processes  (or  random 
walks,  or  diffusions)  to  a  limiting  diffusion  process.  Although  the  general 
set-up  and  notation  required  to  discuss  Stone's  results  in  detail  are  too  involved 
for  this  paper,  perhaps  a  few  heuristic  remarks  would  be  helpful. 

The  processes  considered  by  Stone  (random  walks,  birth  and  death  processes, 
and  diffusions)  enjoy  the  property  that  points  in  the  state  space  are  not  jumped 
over  by  the  process;  i.e.,  possible  transitions  can  only  occur  to  neighboring 
states  in  the  discrete  case  and  the  path  functions  are  continuous  in  the  contin¬ 
uous  state  space  case.  This  property  remits  in  the  infinitesimal  operator  of  the 
semi-group  of  transition  functions  being  -  'ocal  operator.  The  infinitesims 
operator  is  essential 1/  determined  (aside  from  b.iy  boundary  conditions  which  may 
have  to  be  imposed)  by  the  infinitesimal  mean  and  variance  of  the  process.  Since 
convergence  of  the  infinitesimal  operators  of  a  sequence  of  processes  implies 
convergence  of  the  processes,  it  seems  natural  to  assume  that  convergence  of  the 
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infinitesimal  means  and  variances  would  also  imply  convergence  of  the  processes. 
This  is  in  fact  true,  except  the  infinitesimal  operator  of  a  semi-group  is  not 
determined  until  the  boundary  conditions  (lateral  conditional  in  Feller's  (1957) 
terminology)  have  been  specified.  Hence  to  obtain  convergence  of  the  processes 
we  need  to  assume  that  the  behavior  of  the  sequence  of  processes  in  the  neighbor¬ 
hood  of  a  boundary  point  convergence  to  that  of  the  limit  process.  Finally,  we 
need  to  check  that  the  state  space  of  the  approximating  processes  becomes  dense 
in  the  state  space  of  the  limit  process. 

In  our  example  of  the  many  server  queue  we  have  shown  in  (10)  and  (11)  that 

the  infinitesimal  mean  and  variance  convergence  uniformly  in  every  compact  interval 

of  the  line.  Furthermore,  the  state  space  of  Yn(t)  becomes  dense  in  (-•,  ♦•), 

the  state  space  of  the  limit  process,  as  n  -►  ■>.  These  are  the  essential  facts 

required  to  apply  Stone's  results  (cf.  Iglehart  (1965)).  The  conclusion  is  that 

1/2 

the  processes  Yn(t)  converge  weakly  to  Y(t)  provided  Xn(0)  -  [np  ♦  (np)  y] 
for  any  real  y. 

To  illustrate  how  the  representation  for  p^(t)  can  be  used  to  obtain  a 
limiting  diffusion,  we  shall  consider  the  telephone  trunking  problem.  In  this 
model  *n(t),  the  number  of  busy  channels,  is  a  birth  and  death  process  with 
parameters 


(n)  * 

nj  ■  J 

for  j  ■  0,1,...,  and  n  ■  1,2,...,  where  c  >  0.  The  representation  (Karlin 


and  McGregor  (1958))  in  this  case  is 


•  -nc,  .k 

Pi  j  (t)  •  J  e“Ktci(k;nx)Cj(k;nc)  , 


where  u^(x;a)>  are  the  Poisson-Charlier  polynomials,  orthogonal  with  respect 


I  *4*4 
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v  4 

to  the  measure  e  a  /x'  on  x  »  0,1,...  and  ■  (nc)J/j!.  By  showing  that 
the  Poisson-Charlier  polynomials,  properly  normalized,  converge  to  the  Hermite 
polynomials  (this  is  not  surprising  considering  the  fact  that  the  Hermite  poly¬ 
nomials  are  orthogonal  with  respect  to  the  normal  density  and  the  implications  of 

the  central  limit  theorem) ,  it  is  not  hard  to  show  that  for 
1/2 

an(x)  ■  [(nc)  x  ♦  nc3,Pa  a  (y^ (t)  is  asymptotic  (when  normalized) 

to  the  transition  density  for  the  Omstein-Uhlenbeck  process.  This  convergence 

provides  a  local  limit  theorem  for  the  convergence  of  the  processes 
1/2 

(Xn(t)  -  nc)/(nc)  '  to  the  Omstein-Uhlenbeck  process. 

For  another  example  of  such  a  local  limit  theorem  obtained  from  the  integral 
representation  for  p^(t)  in  the  case  of  the  Ehrenfest  urn  model,  the  reader 
should  consult  Karlin  and  McGregor  (1965). 

5.  Luchak's  Queueing  Model  in  Heavy  Traffic 

Luchak  (1958)  studied  a  single-server  queueing  model  in  which  customers 
arrive  according  to  a  Poisson  process  with  rate  X  >  0  and  require  a  random 
number,  N,  of  phases  of  service,  each  phase  being  exponential  with  parameter 
u.  Luchak  considered  the  transient  behavior  of  the  phase  length  process,  Q(t), 
which  is  the  number  of  phases  present  in  the  system  at  time  t  including  the 
phase  in  service.  His  result  is  given  in  terms  of  a  rather  unwieldy  transform. 

Recently  Lalchandani  (1967)  considered  in  his  thesis  the  behavior  of  Q(t) 
in  heavy  traffic  (i.e.,  as  the  traffic  intensity  approaches  one).  Me  shall  give 
a  brief  outline  of  his  method  which  seems  to  have  some  general  interest. 

The  distribution  of  N  is  discrete  (P(N»nj  •  cfl,  n«l,2,...)  with  finite 
mean,  a,  and  second  moment,  b.  With  these  parameters  the  traffic  intensity 
p  -  Xa/u.  Consider  now  a  sequence  of  queueing  systems  indexed  by  n  (n>l)  for 
which  the  arrival  and  service  parameters  are  Xn  and  pn>  For  the  corresponding 
traffic  intensity,  pn,  to  tend  to  1  we  choose,  as  an  example,  Xn  ■  n  and 
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u  ■  a(n  +  /n).  If  0  't)  denotes  the  phase  length  process  for  the  n**1  system, 

n  i* 

Lalchandani  shows  that  the  distribution  of  Xn(t),  defined  as 


x„(«) 


[(a*b)n]1/2 


converges  as  n  »  to  the  distribution  of  Brownian  motion  at  time  t  with 
initial  state  y,  provided  0^(0)  ■  [  /(a+b)n  •  y  ♦  n]. 

While  the  Qn(t)  process  is  a  continuous  parameter  Markov  chain,  it  is  not 
a  birth  and  death  process  and  hence  we  do  not  have  available  the  special  tech¬ 
niques  for  such  processes.  Consider,  however,  the  discrete  parameter  jump  chain 
(^(k),  corresponding  to  (^(t).  In  one  step  the  chain  (^(k)  goes  up  j  with 
probability  c^An  /  (A^  ♦  un)  and  down  1  with  probability  Pn/(*n  *  yn),  provided 

(^(k)  >  0.  If  ^(k)  *  0,  then  the  only  transitions  are  up  j  with  probability 

\ 

Cj.  Except  for  the  troublesome  state  0,  the  (^(k)  process  is  a  process  of  sums 

of  independent  random  variables.  However,  when  pn/l,  we  would  not  expect 

the  queue  to  be  idle  often,  and  thus  it  is  not  too  surprising  that  the  limit 
% 

process  for  ( [ A^  ♦  un)t])  is  Brownian  motion,  where 

5ln(k)  ■  (<^(k)  -  nj/Ka+bjn]1^.  One  of  the  crucial  steps  in  showing  this  con¬ 
vergence  in  distribution  is  the  study  of  the  behavior  of  Ptfj^Cj)  ■  0). 

Once  the  convergence  of  the  suitably  translated  and  scaled  jump  chain  is 
obtained  it  is  not  difficult  to  show  that  the  Xn(t)  process  converges.  If 
we  consider  the  tine  points  at  which  jumps  in  the  (^(t)  process  occur,  it  is 
clear  that  these  time  points  are  essentially  a  renewal  process  with  exponential 
lifetimes  having  parameter  (AR  ♦  »n).  Very  occasionally,  however,  the  queue  is 
empty  and  the  exponential  parameter  is  An.  Define  Nn(t)  to  be  the  number  of 
jumps  of  the  (^C*)  process  in  the  interval  [0,t].  Then  the  mean  and  variance 
of  N  (t)  are  shown  to  behave  like  (A_  ♦  y  It  as  n  •.  Now  a  conditional 
probability  argument  can  be  used  in  which  the  P{Xn(t)  <,  x|N^(t)  »  j)  is 
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is  identified  with  the  P{Xn(j)  <,  x},  Finally,  in  this  manner  Lalchandani  shows 
that  the  distribution  of  Xn(t)  con'ftrjr^s  to  the  appropriate  normal  distribution. 

While  the  argument  outlined  above  used  the  specific  structure  of  the  Harkov 
chain  (^(t),  it  seems  likely  that  the  idea  of  looking  first  for  limit  processes 
of  the  jump  chain  should  have  greater  applicability. 

6.  The  Multi-urn  Ehrenfest  Model 

In  the  multi-urn  Ehrenfest  model  N  balls  are  distributed  among  d+1  (d>2) 
urns.  If  we  label  the  urns  0,1,..., d,  then  the  system  is  said  to  be  in  state 

4 

^  -  (ij,ij, . » . ,id)  when  there  are  i^  balls  in  urn  j  (j«l,2, . . . ,d)  and 
balls  in  urn  0.  At  discrete  epochs  a  ball  is  chosen  at  random  from  one  of  the 
d+1  urns;  each  of  the  N  balls  has  probability  1/N  of  being  selected.  The  ball 
chosen  is  removed  from  its  urn  and  placed  in  urn  i  (i*0,l, . . . ,d)  with  prob¬ 
ability  p*,  where  the  p^'s  are  elements  of  a  given  vector,  (p°,]j),  satis- 

i  d  i 

fying  p  >  0  and  l  p  *  1.  We  shall  let  ^.(k)  denote  the  state  of  the 

i-0 

system  after  the  kth  such  rearrangement  of  balls.  In  this  section  we  shall 
discuss  some  limit  theorems,  which  were  obtained  by  the  author  (Iglehart  (1967)) 
for  the  sequence  of  processes  {^(k):  k«0,..,,N)  as  N  tends  to  infinity. 

For  the  classical  Ehrenfest  model  (d«l,  p^»p*«  1/2)  Kac  (1947)  showed  that  the 
distribution  of  (^((NtJ)  -  N/2)/(N/2)^2  converges  as  N  -*■  •  to  the  distribu¬ 
tion  of  the  Omstein-Uhlenbeck  process  at  time  t  having  started  at  y ^  at 
t»0,  provided  ^(0)  •  [(N/2)^2yQ+N/2].  Recently,  Karlin  and  McGregor  (1965) 
obtained  a  similar  result  for  the  continuous  time  version  of  the  model  with  d»2; 
in  this  version  the  random  selection  of  balls  is  done  at  the  occurrence  of  events 
of  an  independent  Poisson  process. 

4.  The  vector  ^  has  all  its  components  equal  to  1  and  is  the  usual 

scalar  product. 
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A  preliminary  calculation  indicates  that  the  process  (JC^k):  k»0,...,N) 
is  attracted  to  the  pseudo-equilibriua  state  Ng  and  that  states  far  froa  Njj 
will  only  occur  rarely.  Thus  it  is  natural  to  consider  the  fluctuations  of 
^(k)  about  measured  in  an  appropriate  scale.  For  our  purposes  the  appro¬ 
priate  processes  to  consider  are  {^(k) :  k»0,...,N},  where 

^(k)  -  (^(k)  -  Njr,)/N1/2. 

Next  we  define  a  sequence  of  stochastic  processes  {^(t):  0  <  t  <  1}  which 
are  continuous,  linear  on  the  intervals  ((k-l)N"*,  kN"*),  and  satisfy 
^(kN*1)  «  ^(k)  for  k*0,l, . . . ,N.  In  other  words  we  let 

fo(t)  «  ♦  (Nt-k) (^(k+1)  -  ^(k)) 

if  kN"1  <  t  <  (k+l)N_1.  Throughout  this  discussion  we  shall  let5 

j£(0)  «  [N1/2yJ  ♦  Np1],  where  ^  -  (yj,...,yjj)  is  an  arbitrary,  but  fixed, 

eleaent  of6  R^.  With  this  initial  condition  and  the  Markov  structure  of  the 
model,  the  processes  {^(k):  k«0,...,N}  for  N»l,2,...  can  be  defined  on  a 
probability  triple  PN)«  shall  let  Cd[0,lj  denote  the  product 

space  of  d  copies  of  C[0,1],  the  space  of  continuous  functions  on  [0,1] 

with  the  topology  of  unifora  convergence,  and  endow  (^[0,1]  with  the  product 

topology.  The  topological  Borel  field  of  C^[0,1]  will  be  denoted  by 
Clearly,  the  transformation  taking  the  sequence  {^(k):  k«0,,..,N}  into 
{^(t):  0  <  t  <  1)  is  aeasurable  and  induces  a  probability  aeasure  on  Me 

shall  denote  this  induced  aeasure  by 

5.  It  will  always  be  understood  that  N  is  sufficiently  large  so  that 

0  <  X^(0)  <_  N  for  all  i«l,2,...,d,  where  Xj^(*)  is  the  i^-th  component  of 
the  vector 

6.  Rd  is  d-diaensional  Euclidean  space. 
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The  principal  result  of  Iglehart  (1967)  is  that  u^(*  ;Xq)^  v(* !Yq)  *s 
N  ♦  «t  where  v( » ;yQ)  is  the  probability  Measure  on  of  a  d-diaensional 
diffusion  process,  y(*)»  starting  at  the  point  yn.  The  liait  process  y(-)  i» 

\  ^  v 

a  d-diaensional  analog  of  the  Ornstein-Uhlenbeck  process  whose  distribution  at 
tiae  t  is  a  aulti-variate  noraal  with  aean  vector  e’*^  and  covariance  aatrix 
£,  where  the  eleaents  of  £  are 

^  (l-e’2t)?1(l-pi)  •  i'J 

°ii  *  " 

-2t  i  i 

-(1-e  “)plp  ,  Wj. 

V. 

To  obtain  the  weak  convergence  of  the  Measures  to  w  we  aust  first 
show  the  convergence  of  the  corresponding  f.d.d.  He  shall  only  be  able  to 
sketch  the  proof  here.  For  the  convergence  of  the  distribution  of  yu(t)  we 
can  consider  the  distribution  of  ^((Nt]),  since  |Y*([Nt])  -  y^ ft) | 
for  ial , . . . ,d  with  probability  one.  The  aethod  of  characteristic  functions 
and  the  Levy  continuity  theorea  is  used.  If  we  let7  +N(g;k)  ■  Ej^expU^^fk)}}, 
then  by  using  a  standard  conditional  probability  arguaent  and  obvious  asymptotic 
expansions  we  show  that 

for  k-0,1, . . . ,N-1,  where 

IhC*)  •  •*!>*-  N’V&  4  • 

*  -  (l-e'21)’1  l. 

^(£,1)  •  (1-N'1  ♦  ©(h”1))^  as  N  ♦  and  the  terms  ©(N”1)  are  unifora 
for  i  in  a  compact  set  of  Rd  and  independent  of  k.  Froa  this  result,  a 
siaple  iteration  one  shows  that 


7.  The  syabol  denotes  expectation  with  respact  to  F^. 
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k-1 

(12)  *N(*,k)  -  n 

j-0 

for  k-l,2,...,N,  where  Jj^.p)  ■  l  *nd  for  )  >  1. 

Now  letting  k  ■  [Nt]  in  (12)  taking  logarithms  we  obtain 

lim  ta*N($,[Nt])  -  -(1/2)  l'  l  l*  • 

N-*»  * 

which  is  the  characteristic  function  of  £(t).  The  convergence  of  the  f.d.d.  is 
shown  by  a  similar  argument.  To  complete  the  proof  of  weak  convergence  a  combi¬ 
nation  of  the  methods  of  Stone  (1961)  and  Billingsley  (19S6)  are  used. 

The  method  outlined  above  to  show  convergence  of  the  f.d.d.  can  be  used  for 
a  variety  of  related  urn  models,  some  of  which  are  associated  with  queueing 
problems.  These  results  will  appear  in  future  publications. 

7.  Weak  Convergence  of  a  Sequence  of  Quickest  Detection  Problems 

Consider  a  production  process  which  is  in  one  of  two  states,  a  good  state 
and  a  bad  state,  which  correspond  to  being  in  control  and  out  of  control.  Pro¬ 
duction  begins  with  the  process  in  control  and  after  each  item  is  produced  there 
is  a  probability  v  of  the  process  going  out  of  control.  A  statistical  control 
procedure  is  desired  which  will  enable  one  to  detect  the  fact  that  the  process  is 
out  of  control  in  some  optimal  manner.  This  model  of  a  production  process  was 
first  introduced  by  Cirshick  and  Rubin  (1952)  and  later  discussions  of  the 
problem  are  due  to  Shiryaev  (1963),  Taylor  (1967),  and  Bather  (1967).  Most  of 
the  work  carried  out  in  these  papers  deals  with  a  continuous  time  analog  in 
which  a  Browniaa  motion  process  with  mean  0  has  a  drift  of  1  introduced  after 
some  independent  exponential  tine.  The  correspomding  optimal  statistical  control 
procedures  are  then  derived  and  proposed  as  good  rules  for  controlling  the 
discrete  processes  The  passage  from  discrete  to  continuous  and  back  to  discrete 
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hat  never  been  carried  out  in  a  rigorous  way.  It  turns  out  that  the  notions  of 

weak  convergence  are  exactly  what  is  required. 

Our  discussion,  based  on  the  paper  Iglehart  and  Taylor  (1967),  begins  with 

a  sequence  of  truncated  processes  which  can  be  easily  described  as  follows.  In 

the  truncated  p rob lea  of  length  n  (>2)  the  process  produces  n  independent 

iteas  and  goes  out  of  control  at  a  random  tine  Tn  (<p).  All  iteas  produced  at 

or  before  tiae  Tr  are  assumed  to  have  a  random  quality  with  distribution 

function  Fq  (having  mean  0  and  variance  1).  All  iters  produced  after  Tr 

possess  quality  given  by  Fj(x)  •  Fq(x-  1) ;  i.e.,  the  process  when  out  of 

control  shifts  the  d.f.  F  by  1  unit.  The  distribution  of  T  is  given  by 

o  n 

(1  -  «J3  1  .  j  -  l,2....,n-l 

(1  -  ,  3  -  n  , 

which  is  si^>ly  a  geometric  distribution  with  paraaeter  */n  truncated  at  n-1 
with  the  remaining  mass  lumped  at  j«n.  If  Tr  ■  n.  then  all  n  items  pro¬ 
duced  have  quality  given  by  Fq.  The  process  is  just  turned  off  after  n  iteas 
have  been  produced  and  if  Tr  ■  n  it  would  be  out  of  control,  but  this  in  then 
irrelevant. 

He  introduce  two  sequences  of  measures  on  0  (Borel  sets  of  C(0,l))  as 

follows.  Let  Xj,X2,...  be  a  sequence  of  independent  randoa  variables  with  d.f. 

F„  and  define 
o 

•  (  l 

i«i  1 

for  k  ■  1,2 . n  and  Xn(0)  •  0.  Then  the  paths  ^(t)  in  C[0,1]  are 

obtained  by  setting  x^t)  ■  (k)  for  t  *  k/n,  k  •  0,1 . n  and  by  linear 

interpolation  for  other  values  of  t.  Let  denote  the  measure  induced  on 
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by  *n(-}«  Next  define  the  continuous  paths  (®n(‘)  on  £0 » 1 1  by 


v> 


(«-T„/n) 


‘  i  V" 


1  ’  V» 


The  Measures  induced  by  0^(‘)  w*  denote  by  A^  r' 
of  production  corresponds  to  yn(t)  *  xn(t)  *  *n(0* 


'.early,  the  obsei«cu  process 


Froa  the  work  of  Prokhorov  (1956)  we  know  that  i*  u ,  where  u  is 

n 

Wiener  Measure  for  paths  starting  at  0.  We  now  introduce  the  Measure  A  on 
induced  by  the  process  8(*)  on  (0,1)  defin  d  by 


®(t) 


(t-T) 


t  <T 

t  >  T 


where  the  randoa  variable  T  has  an  exponential  density  fT(t)  with  paraaeter  * 
for  0  <  t  <  1  and  assuaes  the  value  1  with  probability  e~V  Using  character¬ 
istic  functions  it  is  easy  to  show  that  the  f.d.d.  of  Afl  converge  to  those 
of  X.  Furtheraore,  since  the  support  of  all  the  Measures  { }  is  contained 
in  the  coapact  set  M  CC{0,1)  given  by 


H  •  {x:  x(t)*0,  t«t  ;  x(t)«t-t  ,  t>t  ;  for  soa*  t  c(0,l))  , 


it  follows  froa  Prokhorov  (1956)  that  the  faaily  of  Measures  is  tight  and  thus 

A_  x.  Finally,  since  the  aeesures  induced  by  y  ft)  are  siaply  the  convolu- 
n  « 

tion  X_  •  u  ,  and  additional  argument  shows  that  X_  •  w  X  •  u. 
n  n  n  n 

Girshick  and  Rubin  (1952)  show  that  the  optiaal  contvl  (under  a  cost 
structure  which  we  won't  aention)  is  to  stop  the  process  when  the  posteriori 
probability  that  the  process  will  be  out-of-control  for  the  next  itea  produced. 
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given  the  history  of  observations,  exceeds  a  certain  level.  It  is  Bore  conven¬ 
ient  to  consider  a  monotone  function  of  this  posteriori  probability  which  saps 
each  path  of  the  process  {yn(t):  0  £  t  <.  1}  into  C[0,1],  This  sequence  of 
mappings  (one  for  each  n)  is  continuous  and  converges  uniformly  on  compact 
sets  of  C[0,1].  Another  result  of  Prokhorov  (1956)  implies  that  the  measures 
induced  by  these  monotone  ^unctions  converge  weakly.  Finally,  this  last  result 
implies  that  the  distribution  .  '  the  optimal  stopping  times  and  the  optimal 
costs  converge.  Thus  we  have  established  in  a  rigorous  manner  the  relationship 
between  the  discrete  models  and  the  continuous  analog. 

8.  Other  Work  on  Convergence  of  Processes  in  Applied  Probability 

In  this  final  section  we  shall  mention  briefly  some  other  work  on  conver¬ 
gence  of  processes.  The  area  of  applied  probability  in  which  diffusion 
approximations  have  been  most  widely  used  is  population  genetics.  This  work 
was  initiated  by  Fi'her  and  Wright  in  the  1930’s.  We  have  not  discussed  any 
of  these  applications  since  a  comprehensive  review  is  available  by  Kimura  (1964). 
A  subsequent  paper  which  treats  diffusion  approximations  in  genetics  from  the 
point  of  view  discussed  here  is  Karlin  and  McGregor  (1964). 

In  branching  processes  several  papers  have  recently  appeared  which  deal 
with  convergence  of  processes.  These  papers  are  Lamperti  (196?),  Lamperti  and 
Ney  (1967),  and  Lamperti  (1967). 

Distribution-free  statistics  such  as  those  of  the  Kolmogorov-Smirnov  and 
Cramer-von-Mises  types  can  be  defined  as  functionals  on  the  sequence  of  empiri¬ 
cal  stochastic  processes.  Convergence  of  these  processes  has  been  studied  by 
joob,  Donsker,  and  Prokhorov.  For  an  excellent  summary  of  this  work,  complete 
references,  and  further  extensions  the  reader  should  consult  Pyke  (1965).  Two 
additional  papers  which  deal  with  order  statistics  and  related  random  variables 
are  Dwass  (1964)  and  Lamperti  (1964). 
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1-  Introduction. 

Certain  techniques  vhich  vere  developed  for  sequential  analysis  have 
been  found  to  apply  to  a  vide  variety  of  problems  including  some  stochastic 
control  problems.  We  shall  outline  these  techniques  and  indicate  how 
they  may  be  applied.  The  main  tool  is  the  representation  of  Incoming 
information  in  terms  of  continuous  time  Wiener  process  vhich  relates  these 
problems  to  the  solution  of  1 ree  Boundary  Problems  involving  the  heat 
equation. 

2.  Some  Problems. 

Consider  the  following  distinct  problems. 

Problem  1.  (A  Sequential  Analysis  Problem).  The  random  variable  x  is 

2/  2 
normally—  distributed  with  unknown  mean  p  and  known  variance  cr  .  The 

i  / 

-  Prepared  with  the  partial  support  of  NSF  Grant  GP  570S  An 
amplified  version  of  this  paper  will  be  submitted  to  Sankhya . 

“  f 

—  We  shall  use  the  fg1 lowing  notation  throughout  the  paper.  Let 

:p(x)  =  (2n 2e"x  and  $>(x)  *  ®(y)dy  to  represent  the 

standard  normal  density  and  cumulative  distribution  functionss.  Tr.e 

2  o. 

normal  distribution  with  mean  u  and  variance  a  is  7((p, c  ) 

2  -1 

and  has  density  n(x;u,cr  )  -  cr  fcp(x-a tyc.1  and  cdf  $[(x-u)/c]. 

We  denote  the  probability  distribution  (lav)  of  a  random  variable 
X  by  £{y)  and  its  mathematical  expectation  by  £(X).  £(XjYj 
and  £( X  \x  ,  represent  the  conditional  distribution  and  expectation 
of  X  given  Y 
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statistician  must  decide  whether  p  >  0  or  4  <  0  and  the  cost  of  an 
incorrect  decision  is  k|p|,  k  >  0.  He  is  permitted  to  sample  sequentially 
(one  observation  at  a  time)  at  a  cost  of  c  per  observation.  He 
may  stop  sampling  at  any  time  and  make  a  decision.  The  total  cost  will 
be  cn  if  the  decision  is  correct  and  cn  +  k|p|  if  it  is  wrong  where 
n  is  the  number  of  observations  taken.  The  cost  is  a  random  variable 
whose  distribution  depends  on  the  unknown  u  and  the  (sequential) 
procedure  used. 

To  determine  an  optimal  procedure  one  must  specify  some  criterion 
of  optimality.  It  is  convenient  to  treat  this  problem  in  a  Bayesian 
context  assuming  that  the  unknown  u  is  normally  distributed  with  mean 
and  variance  (both  known).  Then  a  sequential  procedure  deter¬ 

mines  an  expected  cost,  and  one  may  seek  that  procedure  which  minimizes 
the  expected  cost. 

Problem  2.  (a  stopping  Problem;.  Let  {X  ,  -m  <  n  <  01  be  a  stochastic 

-  n  —  — 

proceso  such  that  X  -  x  is  specified  and  X  ,,  =  X  +u  where  the 

-m  n+l  n  n 

u .  are  Independently  and  normally  distributed  with  mean  0  and  variance 
1.  Thus  it  Ls  convenient  to  think  of  Xn  changing  as  the  subscript  n 
increases  to  zero.  For  each  n  <  0,  an  observer  can  stop  the  process 
and  collect  or  he  can  wait  till  n  -  0  at  which  point  he  collects 

0  .f  Xq  >  snd  Xq  if  Xq  <  C  However,  he  must  pay  1  for  each 

ob-erv-jt  ion.  What  constitutes  an  optimal  procedure  for  stopping? 

Problem  1-  (Pocket  Control-Infinite  Fuel).  A  rocket  is  directed  toward 
At  certain  time  points  ft^l,  instruments  make  measurements 
e~i  im-it  (ng  tr.f  distance  by  which  it  will  miss.  The  miss  distance  can 


then  be  adjusted  by  an  "instantaneous"  use  of  fuel.  The  cost  of  missing 

2 

by  an  amount  y  is  ky  .  The  cost  per  unit  of  fuel  is  c.  As  much 
fuel  as  is  desired  is  available.  How  should  fuel  be  allocated  to  minimise 
expected  total  cost? 

Certain  simplifications  and  specifications  are  necessary  to  make 
this  problem  meaningful  and  tractable.  The  assumption  of  an  infinite 
supply  of  available  fuel  is  already  such  a  simplifying  assumption.  Let 
us  replace  the  natural  two-dimensional  miss  vector  by  a  one  dimensional 
value  which  can  take  on  positive  and  negative  values.  Let  us  assume  that 
if  an  amount  of  fuel  A  is  used  at  time  t  the  miss  distance  is  changed 
by  ±e(t)A  where  e(t)  represents  the  efficiency  of  fuel  at  time  t. 

Since  fuel  is  used  to  change  direction,  the  efficiency  e(t)  is  greatest 
at  the  beginning  of  the  flight  when  the  rocket  is  far  away  from  the  target. 
We  shall  assume  that  the  amount  of  fuel  used  and  its  effect  can  be  con¬ 
trolled  and  measured  with  precision.  Let  us  now  assume  that  the  mea¬ 
surements  estimating  the  miss  distance  are  at  time  t^  where  the 
x^  are  independent  and  normally  distributed  with  mean  equal  to  the  miss 

distance  (provided  no  additional  fuel  is  used)  and  known  variance  equal 

2 

to  <7^. 

Here  as  in  the  Sequential  Analysis  problem,  the  performance 
characteristics  of  a  procedure  depend  on  the  unknown  value  of  a  funda¬ 
mental  parameter.  In  the  Sequential  Analysis  problem  that  was  u.  Hew 
it  is  the  unknown  miss  distance  that  would  be  obtained  if  no  adjustments 
were  made.  As  in  the  sequential  analysis  problem  ve  find  it  convenient 

to  assume  that  the  unknown  miss  distance  has  a  normal  prior  distribution 

2 

with  specified  mean  and  variance  oq.  Then  there  is  an  expected 
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cost  for  each  procedure  and  the  problem  of  selecting  an  optimal  procedure 
is  meaningful. 

These  three  problems  have  more  flavor  if  the  competing  factors  are 
indicated  qualitatively.  In  the  sequential  problem,  after  much  data 
has  been  accumulated  one  is  either  reasonably  certain  of  the  sign  of  11 
or  that  ||ij  is  so  small  that  the  loss  of  deciding  wrong  is  less  than 
the  cost  of  another  observation.  Here  one  expects  the  proper  procedure 
to  be  such  that  one  stops  and  makes  a  decision  when  the  current  estimate 
of  Itij  is  sufficiently  large  and  continues  sampling  otherwise.  What 
constitutes  sufficiently  large  depends  on,  and  should  decrease  with,  the 
number  of  observations  or  equivalently  the  precision  of  the  estimate.  It 
can  be  shown  that  after  a  certain  sample  size  it  pays  to  stop  no  matter 
what  the  current  estimate  of  j^‘  is. 

In  the  f topping  Problem,  it  is  clear  that  if  is  sufficiently 
negative  (depending  or  n)  one  ought  to  pay  the  cost  of  continuing  one 
more  step.  It  is  to  be  expected  that  there  are  limits  yR  so  that  for 
Xfl  <  yn>  it  pays  to  continue  and  for  >  yn  it  pays  to  stop. 

In  the  rocket  problem,  -hen  the  rocket  is  close  to  target,  fuel 
efficiency  may  t*»  so  low  that  even  through  the  nut  distance  is  practically 
known  one  would  not  •u*'  *  :r*.;es&  the  mu  'distance- were  very  great. 

When  the  rocket  if  fir  from  the  target  fuel-  is  effective  but  the  •lea 
distance  is  not  veil  known  and  so  one  U  reluctant  to  make  an  adjustment 
for  fear  of  overshooting  or  adjuring  in  the  wrong  direction.  Here  one 
should  expect  the  solution  to  have  the  following  property.  There  arc 
limits,  y,  s?  *  ime  .  t  such  that  \*  the  ear  lasted  miss  distance  exceeds 

si  .* 

y,  in  absolute  value,  et-iT.gh  fuel  U  .-  <e«!  to  take  the  adjusted  estimate 


to  iy^ .  It  seems  reasonable  to  e/pect  the  y,  to  be  large  at  the 
beginning  of  flight  and  at  the  end  of  the  fm  and  relatively  snail 
in  between- 

For  specific  values  of  the  constants,  ail  three  of  these  problems 
can  be  solved  numerically  by  the  backward  induction  techniques  of  dynamic 
programming.  For  the  sequential  analysis  problem  care  muct  be  taken 
to  initiate  the  backward  induction  at  a  sample  size  n  sufficiently  large 
so  that  no  matter  what  the  estimate  of  *  is,  the  optimal  procedure  will 
lead  to  a  decision  rather  than  to  additional  sampling.  The  technique  of 
the  backward  induction  can  te  summarized  by  the  equation 


(2.1 ) 


o  (  r  *  inf  e  L-  ,  [t  ,U  .« '))} 
n  n  .  n~*.  n*i  n  n 


where  ry't^)  is  the  expected  cost  of  an  optimal  procedure  given  the 
history  I  up  to  stage  n,  I  (a  )  describes  the  history  up  to 
stage  n*i  which  may  be  random  with  distribution  depending  on  $n  and 

the  act  ior-  a  taken  at  at  age  n.  It  is  possible  to  show  that  lr.  the 

seq^’-ht jai  antiiyv;?  and  rocket  oatroi  problems.  {  ii  adequately 
summarised  by  the  mean  and  variance  of  the  pros  ter  i  or  dl*tnbut  ion  of  u 
while  in  the  stopping  prot.e-  my  be  weed  to  describe  {  .  The 
minimizing  ?>n  which  depend  on  *  determine  the  optimal  procedure. 

r?  .-ust  rate,  we  n.ne  that.  f  ?r  the  stopping  problem  n  (x  *  «  m 

for  4  >  and  ej*  m-x4  f-se  x  v  Unee  reaching  n  *  0  implies 
a  p^-ment  of  s  for  uosi*  ,n-.r.g  m  **•;  At  n  *  *l»  the  choice  of 
i.'opptrg  >■  >:  »  x  •  '  s  :c of  e  • ,  ! >  the  choice  of 


Thus 


n 


O  -X 


P_XU)  =  minf  (m-1). 


m- 


(x+u/^<p(u)duj 


and  the  best  action  for  X  ,  =  x  is  to  stop  or  continue  depending  on 
which  of  the  two  terms  in  the  brackets  is  smaller.  Having  evaluated 
p_^(x),  one  can  in  principle  preweed  in  the  same  way  to  obtain  p  g 
and  optimal  decision  for  n  =  -2  as  a  function  of  X_~,  etc. 

The  rocket  and  sequential  analysis  problem  seem  more  complex  in 
that  they  involve  the  posterior  distributions,  but  the  calculus  of 
posterior  distributions  (to  be  discussed  in  Section  6)  when  dealing  with 
normal  random  variables  and  normal  priors  permits  these  problems  to  be 
treated  with  equal  facility. 

In  a  sense  then,  these  problems:  arc  trivial-  If,  however,  it  is 
desired  to  derive  some  overall  view  of  now  the  solutions  depend  on  the 
various  parameters,  the  simple  though  extensive  numerical  calculations 
of  the  backward  induct  ton  are  no*  *dei.;a*e 

Ar  approver,  which  t>  iae  f  ir*,  icu.ar  relev.net  to  large  sample 

theory  is  that  of  replacing  tae  discrete  ’  im  random  vnr  moles  by 
analogous  eon*  ;n>oug  tire  ;  *  .***.»*•  sc  pr-xe-ts*;-  : nr  use  of  th*  Wiener 
process  seem*  e-pec ’ally  r*>  .ev*n*  and  **rve#  to  convert  the  problem  to 
one  in  which  the  analytic  methods  of  partial  differential  equation?  can 
serve  f**u;*f 

5.  Wiener  Process- 

.* ipp<  e  x-,x^,..-»x  ,  are  independently  and  identically  distributed 
with  ttem  »  and  hariarc*  #*' .  let  *  x,  *x+  ■  *  *  *x  .  Then  for 

n  1.  we  n 


n  >  m  >  0,  Xn-Xm  is  independent  of  X1#X2, • • • »Xm,  and  normally  distri- 

buted  with  mean  (r.-mjp  and  variance  (n-nt)c/  •  When  n  is  large,  a 

graph  of  the  discrete  process  resembles  the  anaiagous,  continuous 

time  process  (X(t):  C  <  t)  which  has  the  following  properties.  The 

function  X(t)  is  continuous,  X{0)  =  0,  and  for  0  <  t^  <  tg, 

X(t2)-X(ti)  is  independent  of  (X(t)r  0  <  t  <  t^}  and  is  normally 

distributed  with  mean  and  variance  a  (tg-t^).  This  maybe 

referred  to  as  a  Gaussian  process  with  independent  increments  or  as  a 

2 

Wiener  process  with  drift  ^  and  variance  a  per  unit  time.  Typically 

the  drift  is  not  referred  to  when  we  consider  the  case  p  »  0.  There 

is  a  trivial  but  occasionally  useful  variation  where  X(t)  is  initiated 

at  the  point  X(t  )  *  x  rather  than  at  X(0)  *  0.  Notationaily  it  is 
o  o 

convenient  to  refer  to  the  process  by  the  equations 


(M 


EldXlt.’i  *  wdt 
Var(dX(t}l  -  o^dt 


This  is  especially  convenient  for  variations  of  the  Wiener  process  which 
are  derived  by  changing  the  scales-  Observe  that  if  E!dX(t))  »  0  end 
Van.'dXtt )'  *  dt.  the  transformation  t*  •  avt,  X*it* )  ■  aX(t)  yields 
Eidxnfj:  *  0  and  Yar?dX*(  t*  )  >  *  dt». 

Although  the  stepping  problem  v Problem  2,  section  2'  does  not  involve 
an  .renown  parameter  an  analogue  or  this  rrofclem  cen  be  posed  in 
tem?  of  a  continuous.  time  Wiener  process  without  drift  originating  from 
a  given  point  »x,t t  <  0- 
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4.  Posterior  Distributions. 

Inasmuch  as  the  Sequential  Analysis  and  Rocket  Control  problems 
involve  unknown  parameters  which  may  be  estimated  by  incoming  data,  both 
have  a  statistical  component.  For  statistical  problems  in  a  Bayesian 
context,  the  posterior  distribution  of  the  unknown  parameter  is  crucial. 

In  the  discrete  case  suppose  that  p  has  prior  distribution 

2  2 
)  (normal  with  mean  pq  and  variance  aQ),  and  for  given  p, 

the  data  x^,x2,...,xn  are  independent  with  distribution  laws 
2  2 

£(x  )  =)^(u,cri),  and  o\  known.  Then  the  posterior  distribution  of 
p  given  the  data  is  (see  [18]). 


(4.1) 


•  •  •  >xn )  -  ^^n,Sn^ 


(4-2)  Y  =  (p  ct"“tx  oC2+’  •  *+x  c r"?)/(a-2+a"2+-..+o‘2)  ,  n  >  0 
n  'roo  11  r  n  ' '  o  1  n  *  - 


(4.3) 


-1  -2  -2,  -2 

3  =  a  4<t  +•  "+a  , 

n  o  1  n 


•a  >  0 


Here  Y  ,  tne  mean  of  the  posterior  distribution,  may  be  called  the 

(posterior)  Bayes  Estimate  of  p.  It  is  a  weighted  average  of  the 

-2 

individual  estimates  weighted  by  the  precisions  (cr^  )  where  the 
prior  distribution  is  treated  as  an  estimate  with  mean  pQ  and  precision 
o^.  Similarly  may  be  regarded  as  a  summary  of  the  previous 
information  and  as  an  estimate  of  p  with  precision  s"1. 


Since  Y  is  the  Bayes  estimate  of  p,  one  should  expect  that  for 
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n  >  m,  E(Y  i'f  )  *  Y  .  It  somewhat  more  surprising  to  find  by  routine 
*  n!  m  m 

but  tedious  calculations  that 

(4.4)  ffA'i  -Y  lY  )  =7?(0,s  -s  )  n  >  m  >  0  . 

'  ’  n  a  m  7  m  n  —  — 

5«  Continuous  Time  Problems. 

The  results  of  section  4  are  of  consequence  in  the  continuous  time 
analogue  of  the  sequential  analysis  problem  which  we  now  state. 

Problem  1* ■  (Sequential  Analysis,  Continuous  Time).  Find  an  optimal 
procedure  for  testing  H-, :  4  >  0  vs.  Hgi  4  <  0  when  the  cost  of  an  in¬ 
correct  decision  is  k  { 4  J ,  the  cost  of  sampling  is  c  per  ur.it  time, 
and  the  data  consists  of  a  Wiener  process  X(t)  unknown  drift  4  and 
known  variance  cf~  per  unit  time.  The  unknown  value  of  4  has  prior 
distribution  <£(4)  =^(4^>°'2)* 

OO 

As  a  consequence  of  the  results  of  section  4  we  have  for  Problem  1*, 
the  posterior  distribution  of  M  given  by 

(5.1)  c?C(m  [X(  t ' ),  0  <  t*  <  t)  =^(Y(s),s) 

where 

(5-2)  Y(s)  -  (400"T2-,-X(t)n’2]/(o'2+t o’2)  , 

(5’ 3)  s  =  +  to'2  , 

and  Y(s)  is  a  Wiener  process  in  the  -s  scale,  originating  at 

O 

(y0,sQ)  =  (M0,crf0),  i-e. 

(3-4)  E ( dY ( s  ) }  =  0  ,  Var(dY(s)  j  =  -ds  . 
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2 

Note  that  s  decreases  from  s  =  cr '  as  information  accumulates. 

o  o 

Since  the  X  process  can  be  recovered  from  the  Y  process  it 
sufficies  to  deal  with  the  latter  which  measures  the  current  estimate 
of  4  and  which  is  easier  to  analyze. 

In  the  sequential  problem,  when  the  statistician  stops  sampling  he 
must  decide  between  4  >  0  and  4  <  0.  The  posterior  expected 

cost  associated  with  deciding  in  favor  of  at  time  t  (when 

Y(s)  =  y)  is  then 

f  kj4jn(4;y,s)d4 

This  quantity  is  readily  computed  and  found  to  be  k ^fs  i|f+(y/Vs')  where 
V  (u)  =  cp(u)-u[l-$(uj].  Similarly  the  posterior  expected  cost  associated 
with  deciding  4<0  is  kV?iT(u)  where  \|f(u)  =  <P(u)+ii*(u).  It  is 
easy  to  see  that  if  sampling  is  stopped  at  Y(s)  =  y  the  decision  should 
be  made  on  the  basis  of  the  sign  of  y  and  the  expected  cost  of  deciding 
plus  the  cost  of  sampling  is  given  by 


(5-5) 


d(y,s)  =  ca2s-1  +  <Ky  /Vs)  -  ccr^/cf 


2  ,.J 

o 


where 


(5*6) 


f<p(u)-u(l-$(u))  u>0 


♦(u)  » 


(<P(  U  )+U$(  U  ) 


u  <  0 


Thus  the  continuous  time  sequential  analysis  problem  may  be  regarded 
simply  as  the  following  stoppLng  problem.  The  Wiener  process  Y(s)  is 
observed.  The  statistician  may  stop  at  any  value  of  e  >  0  and  pay 
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d(Y(s),s).  Find  the  stopping  procedure  which  minimizes  the  expected 
cost.  In  this  version  of  the  problem  using  the  posterior  Bayes  Estimate, 
the  statistical  aspects  involving  the  unknown  parameter  n  have  been 
abstracted. 

The  original  discrete  time  sequential  problem  can  also  be  described 

in  terms  of  this  stopping  problem  by  adding  the  proviso  that  allowable 

stopping  values  of  s  are  restricted  to  sQ, s^,Sg,. • where 
-2  -2  -1 

sn  =  (a  +na  )  .  At  this  point  it  should  be  reasonably  straightforward 

for  the  reader  to  see  that  the  discrete  version  can  be  treated  numerically 
by  backward  induction  in  terms  of  the  Y(s)  process  starting  from 
sn  <  c2/k2v2(0)  =  2«c2/k2. 

We  now  present  the  continuous  time  version  of  the  stopping  Problem 
2  of  Section  2. 


Problem  2*.  (stopping  Problem).  Let  Y(A)  be  a  Wiener  process  in  the 
-s  scale  with  E(dY(s)}  =  0,  Var(dY(s))»  -ds,  originating  from 
(yo,so).  Let 


07) 


if  y  <  0  and  s  *  0 
otherwise  (s  >  0) 


Find  the  stopping  time  to  minimize  E{d(Y(s),s)j. 

A  more  literal  translation  of  Problem  2  of  Section  2  would  yield 
a  stopping  cost  d*(y,s)  =  d(y,s)  +  sQ  Since  the  difference  la  constant 
it  does  not  affect  the  solution. 

While  the  rules  of  computing  posterior  distributions  extend  to  the 
rocket  control  problem,  that  problem  is  not  trivially  reduced  to  a 
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stopping  problem.  However,  we  shall  see  that  the  continuous  version 
of  the  infinite  fuel  problem  posed  in  Section  2  is  also  equivalent  to 
a  related  stopping  problem. 

6.  Continuous  Time  Stopping  Problems: Relevance  of  Stopping  Sets. 

A  general  class  of  stopping  problems  may  be  described  as  follows. 
Let  Y(s)  be  a  Wiener  process  in  the  -s  scale  originating  from 
(y0,sQ)  with  E{dY(s)}  =  0  and  Var{dY(s)}  =  -ds.  Let  d(y,s)  be  a 
specified  stopping  cost.  Select  a  stopping  procedure  S  to  minimize 
the  risk 

(6.1)  b(yo,sQ)  =  EWY(S)>S))  • 

A  stopping  procedure  S  is  a  measurable  rule  which  determines  the 
stopping  time  S  in  terms  of  the  ,:past  history  '  of  Y(s).  Technically, 
in  measure  theoretic  terms  this  may  be  translated  to  mean 

(6.2)  {S  >  s^)  1 2S(Y(s)s  sq  >  s  >  s1) 

where  the  right  hand  side  is  the  Borel  Field  generated  by  the  process 
from  sQ  to  Stopping  procedures  may  be  subjected  to  restrictions 

which  are  either  of  the  form  that  stopping  is  not  allowed  on  certain 
sets  of  points  (y,s)  or  that  stopping  is  automatic  on  other  sets. 

For  example  in  the  continuous  time  versions  of  both  problems  1  and  2  of 
Section  2,  stopping  must  take  place  if  s  ■  0.  In  a  trivial  sense  the 
discrete  time  version  of  the  sequential  analysis  problem  may  be  regarded 
as  a  continuous  time  problem  where  stopping  is  not  permitted  except  at 
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a  certain  set  of  values  of  s. 


While  the  discrete  time  problems  of  Section  2  are  theoretically 
trivial  insofar  as  the  solutions  can  be  computed  by  backward  induction 
this  is  not  the  case  for  tne  continuous  time  problems.  Even  discrete 
time  problems  with  an  infinite  sequence  of  possible  decision  times  lead 
to  difficulties.  The  problems  of  the  existence  and  characterization  of 
solutions  are  deep  and  much  remains  to  be  done  to  obtain  precise  rigorous 
results  for  the  continuous  time  problem.  We  shall  proceed  in  a  heuristic 
fashion  conveniently  ignoring  some  of  the  more  delicate  questions  which 
have  to  be  faced  ultimately. 

Let 

(6.3)  P(y0>s0)  =  lm’  b(yo'8o) 

among  all  procedures  S.  Iiote  that  p(yo,sQ)  <  d(yo,sQ).  Since  Y 
is  a  process  of  independent  increments,  it  follows  that  p(y,s)  also 
represents  the  best  thao  can  be  expected  once  Y(s)  •  y  is  reached, 
irrespective  of  how  it  was  reached.  Then,  a  characterization  of  an 
optimal  procedure  (under  regularity  conditions)  is  described  by 

(I)  SQ;  "Stop  as  soon  as  p(Y(s),s)  *  d(Y(s),s)" 

Since  the  optimal  procedure  SQ  is  characterized  by  the  continuation 
set 

(6.4)  »  {(y,s):  p(y,s)  <  d(y,s)) 
and  the  stopping  set 

(6.5)  *  l(y*s);  p(y,s)  •  d(y,s)) 


1? 


ve  shall  restrict  our  attention  to  procedures  which  can  be  represented 
by  a  continuation  set  C  or  its  complement  the  stopping  set  d  . 


It  is  interesting  to  note  that  the  characterization  (i)  does  not 
depend  on  the  initial  point  (yQ, sQ)  and  thus  it  yields  the  solution 
for  all  initial  points  simultaneously,  minimizing  b(y,s)  uniformly 
for  all  (y,s). 

Under  suitable  regularity  conditions  on  d(y,s),  the  solution  of 
the  continuous  time  stopping  problems  may  be  approximated  by  discrete 
time  versions  corresponding  to  a  finite  sequence  of  permitted  stopping 
times  (s^Sg,  •  •  •  >sn)*  Since  a  discrete  version  permits  less  choice, 
the  corresponding  optimal  risk  p*  is  larger  and  the  corresponding 
optimal  continuation  set  Cq  intersects  s  =  s^  on  a  smaller  set  than 
does  <£q.  As  more  elements  are  adjoined  to  the  set  of  permitted  stopping 
times,  p*  decreases  and  the  set  where  intersects  s  *  s.  Increases. 
In  this  way  p  and  <SQ  may  be  derived  as  limits  of  monotone  sequences. 

7.  Stopping  Problems.  Relevance  of  Heat  Equation. 

The  Wiener  process  is  intimately  related  to  the  heat  equation. 

Suppose,  for  example  that  b(y,s)  is  the  expected  cost  corresponding 
to  an  open  continuation  set  6  and  stopping  cost  d(y,s).  Then  we  shall 
demonstrate  that 

(7.1)  |  byy(y's)  *  Vy's)  (y»fl) €  ^ 

while 

(7>2)  b(y,s)  »  d(y,s)  (y,s)  e  J  . 

Suppose  (y,s)  e  C-  Then,  the  probability  of  stopping  between  s+6  and 


Ik 


s  is  0( 6  )  and  Y  changes  from  Y(s+6  )  to  Y(a).  Consequently 


b(y,s+8  )  =  E{b(Y(s),s) |Y(s+6  )  =  y}  +  0(8  ) 

(7-5) 

=  E£b(y+vVT,s)}  +  0(b  ) 

where  we  use  w  as  a  generic  ??(0,l)  random  variable 

b(y,s+-6  )  =  E(b(y,s)+wVT  by(y,s)+  |  w2(  8  )b  (y,s)+*  •  • }  +  0(  8  ) 

■  b(y,s)  +  |  byy(y,s)(  6  )  +  0(  8  ) 
and 

b  *  i  b 
s  2  yy 

Doob  has  elaborated  on  the  relationship  between  the  Viener  process 
and  the  heat  equation  Indicating  that  it  represents  the  natural  way  in 
which  to  study  the  heat  equation.  To  digress  briefly  and  omitting 
regularity  conditions,  a  subparabolic  function  u  on  an  open  set  D 
is  such  that  for  a  Wiener  process  Y(s)  originating  at  (y,s) 

(7-M  u(y,s)  <  E{u(Y(S),S)} 


where  S  is  the  time  when  Y(s)  first  hits  the  boundary  of  an  open  set 
6  c  D  of  which  (y,8)  is  an  interior  point.  A  parabolic  function  is 
one  for  which  the  inequality  is  replaced  by  equality.  If  the  second 
derivatives  are  continuous  then 


(7.5) 
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for  subparabolic  functions  with  equality  for  parabolic  functions.  Thus 
solutions  of  the  heat  equation  are  identified  with  parabolic  functions. 

To  solve  the  Dirichlet  Problem  (solution  of  j  u  *  ufi  in  C  subject 
to  u  ■  f  on  the  boundary  of  &)  Doob  takes  u(y,s)  ■  E{f(Y(S),S)}. 

The  concept  of  subparabolic  functions  provides  another  characterization 
of  the  optimal  risk.  Ve  observe  that 

(II)  p(y,s)  is  the  maximal  subparabolic  function  which  is  less 
than  or  equal  to  d(y,s). 

To  see  that  p  is  subparabolic,  take  an  arbitrary  set  &  of  which 
(y,s)  is  an  interior  point.  Then  E{p(Y(S),S))  represents  the  risk 
associated  with  the  suboptlmsl  procedure  which  does  not  stop  as  long  as 
(Y(s  ),s )  €  £  but  which  proceeds  optimally  thereafter.  Thus 

(7-6)  p(y,s)  <  E(p(Y(s),S)) 

and  p  is  subparabolic.  Let  p1  be  any  subparabolic  function  such  that 
p^  <  d.  Using  the  optimal  continuation  set  tor  & ,  we  have, 

for  (y,s)  t  Ctf 

p(y,s)  -  E(d(Y(S0),S0))  >  E(0l(Y(S0),S0))  >  o^a)  . 

If  (y#a)  i  p(y#a)  «  d(y,s)  >  p^s)  which  completes  the  proof. 

Given  a  function  u(y,s)  and  a  continuation  set  £,  can  ve 
determine  whether  (u,£)a(p,^)  l.e.  whether  (u,4&)  solve  the 
optimisation  problem  associated  with  the  stopping  problem.  A  sufficient 
condition  is  the  following. 
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(III)  If  u  <  d  Is  a  subparabolic  function  which  Is  parabolic  on 
the  open  continuation  set  £  and  u  *  d  elsewhere,  then 
(u,£)  *  (p >6q)>  the  solution  of  the  optimization  problem- 

To  show  this,  note  that  since  u  <  d  is  subparabolic,  u  <  p. 

But  u  is  the  risk  corresponding  to  the  continuation  set  C •  Hence 
u  >  p. 

8.  stopping  Problems  -  Free  Boundary  Problem. 

Associated  with  a  procedure  described  by  a  continuation  set  C  we 
have  a  risk  function  b(y,s)  which  satisfies  the  heat  equation  in  £ 
subject  to  the  boundary  condition  b  «  d-  The  solution  of  the  optimal 
stopping  problem  minimizes  b  everywhere.  Bow  we  present  the  character¬ 
ization 

(IV)  p  (y,s)  ■  d  (y,s)  on  the  boundary  of  C  . 

JT  Jr  w 

While  the  Dlrlchlet  problem  of  finding  b  which  satisfies  the  heat 
equation  in  a  given  £  subject  to  b  •  d  on  the  boundary  is  referred 
to  as  a  boundary  value  problem,  that  of  finding  b  and  £  so  that 
by  •  dy  on  the  boundary  also,  is  referred  to  as  a  Stefan  or  free  boundary 
problem  (f  .b.p. ).  Property  (IV)  states  that  the  solution  of  the  opti¬ 
mization  problem  is  the  solution  of  the  (f.b p> ). 

To  demonstrate  (IV),  let  u»  assume  that  (yQ,«0)  is  a  point  on  a 
portion  of  the  boundary  above  which  are  stopping  points  and  below  which 
are  continuation  points  and  that  dy  exists  at  (y0#80)*  Then  since 
o(y,*0)  *  *(y»»0)  for  y  >  y0  the  right  hand  derivative  o*(j0,»9)  * 

4y(y0,*0>-  *>r  y  <  yc»  <>(y0,s0)  <  ^y0»*0)  l*oc* 


0y(yo'8o^  -  Vyo»Bo)-  Now  we  note  that 

(8.1)  p(y0,8o+5  )  <  v{p{y0+*fP,  s0)) 


since  the  right  hand  side  corresponds  to  the  risk  of  the  subopt Inal 
procedure  where  one  insists  on  sampling  from  &0+6  to  sQ  and  procedlng 
optimally  thereafter.  But 


p(y  ♦wV’b'jO  *  p(yrt»3ft) +  P^(yft#sJ  +  o( VP)  v  >  0 


(8.2) 


p(ye#s0)  +  v/P  p“(yQ,so)  +  o(Vb^)  w  >  o 


E(p(yo*vV&\8o))  »  p(y0#8o)  (d^  J  wq>(w)dw+o"  J  w<p(  v  )dw  }+o(  VV) 

*  p(y0»*0)  (^y-Py)  +  °«V^) 


Thus 

0(yo»«o+6  '  "  °(y0*#0^  -  V®'  (dy-Py)  ♦  (V^) 


Assuming  that  the  difference  quotient  ip(yo,so*6  )-c(y0,s0))(  b  )-1  is 
bounded  It  follows  that  dy-Oy  >  0  which  combined  with  the  preceding 
results  gives  oy  •  dy  on  the  boundary  which  establishes  (IV). 

Returning  to  tt*  free  boundary  problem  (f.b.p. )  the  following 
question  arises.  Is  a  solution  of  tha  f.b.p.  necessarily  a  solution 
of  tha  optimisation  problem?  The  answer  is  get  provided  certain  additional 
conditions  are  satisfied.  That  additional  conditions  are  required  la 
clear  from  the  following  considerations.  Suppose  that  (u,5)  is  a 
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solution  of  both  the  f.b.p.  (*  u„r  *  u  on  G,  u  ®  d  and  u  »  d 

d  yy  s  y  y 

on  the  boundary)  and  the  optimization  problem  (u  =  p,  Cb&J.  if  the 
problem  is  modified  by  sharply  decreasing  d  below  u  on  part  of 
then  {'d,C)  remains  a  solution  of  the  free  boundary  problem  but  the 
solution  of  the  optimization  problem  changes.  If  d  is  sharply  decreased 
on  a  small  part  of  the  stopping  set  near  the  boundary  of  C,  the  optimal 
continuation  region  should  be  enlarged  but  here  again  (u^#)  remains 
a  solution  of  the  free  boundary  problem. 

These  examples  lead  to  sufficient  conditions  which  are  related  to 
III.  One  of  these  (V)  may  oe  paraphrased  to  state  that  if  one  can't 
trivially  improve  on  u  (as  was  possible  in  the  above  counterexamples) 
then  u  *  p.  Let 

(8.3)  h(y,s;s')  »  ElhCy+wVs*®*#®' ))  s  >  s' 


If  u  la  the  risk  corresponding  to  the  continuation  eat  £  and 
(i)  u(y,s)  <  d(y,s)  and 
(ii)  u(y,s;s' )  >  d(y,s)  for  (y,s)  c  J 


then  (u,$)  solves  ti* 


prgbjep. 


While  (V)  does  not  invoke  the  f.b.p.  condition,  that  condition  can 
be  used  to  prove  condition  (ii)  of(»*).  This  yields 

(VI)  If  (u ,C)  is  a  solution  of  the  f.b  p.  where  C  is  a  continuation 
set  and  u  and  d  have  bounded  derivatives  up  to  third  order 


on 


(if) 

then 


(u,£)  is  a  solution  of  the  optimization  problem. 


In  some  applications  (VJ.)  is  not  enough  because  some  of  the  conditions 
break  down  as  s  approaches  its  lower  limit  sQ  (possibly  -«*}.  In 
that  case  ic  suffices  to  invoke  some  supplementary  condition  which 
implies 

(8.4)  sup  |u(y,s)-p(y,s)|  ->C 

s— »s 

o 


9.  Solutions,  Bounds  and  Expansions  for  Stopping  Problems. 

In  the  continuous  version  of  Problem  2,  ve  have  a  stopping  problem 
which  may  be  represented  by 

d(y,s)  =  -s  for  s  >  0  and  for  y  >  0,  s  =■  0 

(?•!)  2 

d(y,s)  =  -y  -s  for  y  <  0,  s  =  0  . 

This  problem -has  the  trivial  solution  where  =  {.(y,  s):  y  <  0,  s  >  0) 
and 

p(y, s)  =  -s  for  y  >  0,  s  >  0 

(9-2)  2 

p(y, s)  *  -y  -s  for  y  <  0,  s  >  0  . 


The  pair  (p,(?  )  is  a  solution  of  the  (f.b.p. )  since  p  «  d  and  p  »  d 

w  if  w 

for  y  =  0.  Property  (V),  Section  8  applies  as  does  a  modified  version 
of  Property  (VI),  Section  8  (a  modification  is  required  because  p  and 
d  are  not  bounded). 

Generally^ stopping  problems  are  not  so  easily  solved.  It  is  useful 
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to  derive  bounds  on  p  and  £Q-  To  illustrate  let  us  introduce  a  new 
stopping  problem^the  importance  of  which  will  be  discussed  later. 

Problem  4.  A  stopping  problem  involving  Y(s),  EdY(s)  =  0,  Var  dY(s)  *  -ds, 


i  y  ion 

s  —  0, 

y  >  0 

(9  3) 

d(y,s)  =  < 

s'1  for 

s  >  0, 

y  >  0 

0  for 

s  >  0, 

'C 

u 

o 

and  stopping  is 

enforced  when 

Y(s )  =  0 

or  s 

=  0. 

Note  that  if  s  is  large,  the  chances  of  obtaining  Y(s)  »  0  (and 
zero  cost)  before  s  =  0  is  large  and  so  one  is  encouraged  to  continue 
unless  Y  is  large.  If  s  is  small,  the  cost  of  stopping,  (s_1)  is 
large  compared  to  the  cost  of  waiting  till  s  »  0  (approximately  Y) 
unless  Y  is  large  and  one  is  encouraged  to  continue  unless  Y  is 
large.  Thus  one  expects  to  have  a  boundary  which  is  high  for  s 
large  and  s  small. 

Let  u(y, s)  be  an  arbitrary  solution  of  the  heat  equation.  Let 
$  be  the  set  on  which  u(y, s)  =  d(y,s).  If  $  is  the  boundary  of  a 
continuation  set^tne  risk  for  the  procedure  defined  by  the  continuation 
set  &  is  b(y,s)  *  u(y,s)  on  £  and  b(y,s)  =  d(y,s)  on  d .  But 
then  p(y,s)  <  b(y,s).  Thus  if  (y0>E0)  is  a  point  of  £  where  u  <  d, 
then  p(y  ,s  )  <  d(y  ,s  )  and  (y  ,s  )  is  a  continuation  point  for 

0  0  .  0  x0  0  0 

the  optimal  procedure. 

For  Problem  take  u^(y,s )  =  y  which  is  a  solution  of  the  heat 
equation.  £  -  {(y,s);  0  <  y  <  s"1,  s  >  0).  Since  u^(y,s)  <  s"1  at 
every  point  of  £,  £  c  <*T  and  the 
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boundary  y^s)  «  s"1  is  a  lower  bound  for  the  optimal  boundary  y(s), 
i.e. 

(9.4)  y(s)  >  y1(s)  =  s’1  . 

We  now  describe  a  method  of  finding  upper  bounds  for  the  optimal 
boundary.  In  more  general  context  these  represent  outer  bounds  for  (%o . 
Let  u(y,s)  be  a  solution  of  the  heat  equation.  Let  &  be  the  set 
on  which  u  (y,s)  =  d  (y,s).  Let  C  be  the  continuation  set  for  which 

y  y 

73  is  the  boundary.  If  u  /  d  on  7S  let  h(s)  =  u(y,s)  -  d(y,s)  along 
the  boundary  and  let  d*(y, s)  =  d(y, s)  +  h(s).  Then  (u/S)  is 
a  solution  of  the  f.b.p.  for  d*(y,s).  Suppose  that  (u,£)  is  also  a 
solution  of  the  optimality  problem  for  d*  and  h(s)  <  0  for  s  < 
and  h(s2)  =  0.  Then  the  modified  problem  is  a  more  "advantageous" 
problem  than  the  original  for  s  =  s2  and 

p(y,s2)  >  u(y,s2)  . 

If  (y2,s2)  is  a  stopping  point  for  the  modified  problem 

(9*5)  p(y2,s2)  £  u(y2»s2)  =  d*(y2>s2)  a  d(y2>s2) 

and  (y2, s2)  is  a  stopping  point  for  the  original  problem. 

In  review  we  obtain  outer  bounds  on  the  continuation  set  by  finding 

arbitrary  solutions  of  the  heat  equation  which  are  suitable  (i.e. 

u-d  <  0  along  the  boundary  where  u  ■  d  ).  In  principle  this  method 

y  y 

is  as  elementary  as  the  other  method  but  in  application  it  is  usually 
more  delicate. 

To  illustrate 
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sinh  ay 


Uo(y>s)  =  y  -  Be4- 


O  a  8 


is  a  solution  of  the  heat  equation  for  which  u2y  =  dy  *  0  when  y  «  y2(s) 
which  is  determined  by 

1  2 
2  a  s 

1  -  Bae  cosh  ay  =  0 


For  s  =  0,  u2  -  d  <  0.  Along  the  boundary  y  «  y^s)^  u2-<i  ta^e  on 
negative  values  for  small  positive  s.  The  smallest  positive  value  s2 
of  s  (if  any)  where  u0-d  vanishes  is  described  by 


2  8  s 

y  -  Be  sinh  ay  <=  s" 


Any  pair  of  parameters  (a,B)  which  yields  such  a  pair  (y2,s2)  may  be 
used  and  the  corresponding  point  (y2>62^  is  a  P0^  of  ^ Q‘  To  find 
the  best  such  point  for  a  given  s?,  we  select  a  and  B  to  minimise 

d(u2-d) 

y2-  If  for  fixed  (a,B),  -  gg-  —  >  0  at  (y2#s2)  it  would  be  possible 

to  adjust  a  and  B  to  decrease  y2-  Thus  we  impose  the  third  condition 

d(u2-d) 


Ba^  £  8  8 

e  cosh  ay  ■  s" 


Together  the  three  conditions  lead  to  the  representation  for  an  outer 
bound  y2(s)  for  the  boundary,  i.e.  y2(s)  >  y(s)  where  y2(s) 
satisfies 


(9.6) 


(K1)172  ■ 
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-1  /  V  -1  1  2  -V 

It  is  of  interest  that  s  <  y2(s)  <  s  +  ^  8  which  indicates  that  y 

is  veil  approximated  by  s”1  for  small  s.  For  large  s  better  approxi¬ 
mations  are  obtained  by  using  similar  arguments  vith  solutions  of  the 
heat  equation  of  the  form 

u  ■  Acp(  ■  wJL^  p\}\  ,  h  >  0 

ys+n  (Vs+h/'  ” 

vhich  yield  the  lower  bound 

(9*7 )  y^(s)  ®Vs'<y(s) 

and  the  upper  bound 

(9-8)  yk(s)  -  (s+h^^yCs) 

where  h  >  e1^  satisfies  s  ■  e1/^2h(h^/^2-e1^2)’1.  Together  these  show 
that  y(s)  «*  s1//2{l+0(s"1))  for  large  s. 

Another  but  related  approach  to  approximating  the  optimal  boundary 
consists  of  finding  asymptotic  expansions  for  the  risk  and  boundary 
near  distinguished  points  of  s;  these  distinguished  points  are  typically 
the  end  points  of  the  range  of  interest.  For  example  s  ■  0  and  » 
are  important  in  examples  1,  2,  and  4.  The  proofs  that  the  formal 
expansions  derived  by  methods  to  be  briefly  described  do  indeed  represent 
approximations  to  the  desired  solution  depend  on  arguments  of  the  type 
described  above. 

One  important  class  of  solutions  of  the  heat  equation  used  in 
generating  expansions  is  that  generated  by  "sources  of  heat"  along  a 
vertical  (s  •  constant)  line.  Thus 
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(9.9)  uQ(y,s)  -  s“1/2cp(a)  ,  a  =  y//s 

represents  a  point  source  of  heat  at  (y, s)  *  (0,0)  and  yields  a 
solution  of  the  heat  equation  for  s  >  0.  Similarly,  functions  of  the 
form 

u(y,s)  =  f h(y'  )dy*  - 

ys  ys 


satisfy  the  heat  equation- 

Such  techniques  lead  to  asymptotic  expansions  for  the  optimal 
solution  of  the  sequential  analysis  problem  (Problem  1*  of  Section  5)  of 
the  form 


2r(s)  *  y(s)s-1/2  ~  {log  a2s^  -  log  8x  -  6(log  a2s^)  1  +  •••)  as 


\  \  -l/2  1  J/2  <•-  a  s  7 

a(s)*y(s)s  '  ~  ^  as  (1-  -gg-  + 


k  6 

as  •  ••• 


as  8  -+  0 


where  a  »  k/c  . 


10.  Control  Problem. 

Let  us  return  to  the  rocket  control  problem  (Problem  3,  Section  2). 
For  reasons  to  be  discussed  later  an  important  case  can  be  described  in 
it6  continuous  time  version  as  follows. 

Problem  3*.  One  observes  Y(s),  a  Wiener  process  in  the  -s  scale  orig¬ 
inating  at  (y0,s0)  with  E(dY(s)j  ■  0  and  Var(dY(s)}  »  -ds.  As  s 
decreases  to  0,  Y(s)  may  be  adjusted  instantaneously  at  any  s  >  0  by 
an  amount  A  at  a  cost  of  |&jd(s)  where  d(s)  ■  s"^.  In  addition  to 
the  accumulated  cost  due  to  the  adjustments  of  Y(s),  there  is  a  coat  of 
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\  Y^(0).  Find  the  rule  for  adjusting  Y(s)  which  minimizes  the 
expected  cost. 

The  discrete  time  version  of  this  problem  corresponds  to  the  specifi¬ 
cation  of  a  finite  set  of  s . ,  s  >  s,  >  s0  >  •  •  •  >  s  *  0  at  which 
changes  (corresponding  to  the  use  of  fuel)  are  permitted.  Let  p*(y,s^) 
represent  the  expected  additional  cost  associated  with  the  optimal 
procedure  for  the  discrete  time  version  given  Y^)  *  y^.  Since  it  is 
possible  to  change  y  instantaneously  at  a  cost  of  d(si)  per  unit  y 

P*(y>st)  <  0*(y,,si)  +  d(si)|y'~y[  . 


from  which  it  follows  that 


(10.1) 


dp*(y,s  ) 

I — -  d(si} 


With  a  slight  variation  of  this  approach  let  p*(y,si)  represent  the 
optimal  risk  at  s  *  s,  subject  to  the  restriction  that  fuel  is  not 
used  at  s  =  s^  Here 

(10.2)  p*(y^s1)  *  inf {p*(y%si)  +  d(s±) |y'-y | )  . 

y 

Regarded  as  a  function  of  y,  p*  has  straight  line  sections  with  slope 
drfdy  =  -i:d(  si ),  where  it  pays  to  use  fuel.  Elsewhere,  it  does  not  pay 
to  use  fuel  and  |d<$ydy|  <  d(si).  Thus  the  optimal  policy  is  described 
by  an  action  set  and  a  continuation  or  no-actlon  set.  If  (y^)  is  on 
the  action  set  one  moves  to  a  point  (y^s^ )  on  the  boundary  of  the 
continuation  set  by  applying  fuel.  Otherwise  no  fuel  is  used  at  this 
stage  It  Is  possible  to  show  that  p*  is  symmetric  and  decreasing  In 
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|yj.  Hence  the  optimal  continuation  set  is  described  by  -y*(s)  <  y  <  y*(s). 

Let  us  proceed  to  the  continuous  time  version  of  the  problem  for 
which  the  above  characterization  still  applies.  The  boundary  of  the 
optimal  no-action  set  is  given  by  ±y(s).  We  restrict  our  attention  to 
procedures  which  may  be  described  by  a  symmetric  no-action  set  with 
boundary  ±y1(s)  and  let  b(y,s )  be  the  additional  expected  cost  given 
Y(s)  =  y  associated  with  such  a  procedure.  We  shall  show  that  b(y,s) 
satisfies  the  following  conditions 


(10.3) 

5  by y(y,s)  -  bg(y,s) 

on  the  no-action  set 

(10.4) 

IWy,E)  ■  Vy,E) 

on  the  no-action  set 

(10.5) 

by(y,s)  *  d(s)  =  s"1 

on  that  part  of  the  boundary 

and  action  set  for  which 
y  >  0,  s  >  0 


(10.6)  b  (0,8)  «  0  for  s  >  0 

(10.7)  by(y,0)  ■  y  for  y  >  0 

However,  in  Problem  4  of  Section  9 >  we  described  a  stopping  problem 

whose  solution  uniformly  minimizes  b  subject  to  the  restrictions 

(10.4-10.?).  Consequently  the  optimal  expected  cost  for  our  control 
problem  can  be  obtained  by  integrating  the  solution  of  the  stopping 
problem.  Problem  4.  The  optimal  no-action  set  is  the  optimal  continuation 
set  of  the  stopping  problem. 

In  this  particular  case  we  have  been  fortunate  and  profited  from 
the  symmetry.  Otherwise  we  would  have,  for  arbitrary  procedures  described 
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by  no-action  sets,  that  b(y,s)  satisfies  (10.3),  (10.4),  (10.7^  and  (10.3) 
replaced  by 

(10.3*)  b  (y,s)  -  ±d(s)  on  the  boundary  and  action  set 

The  optimality  condition  required  to  determine  the  free  boundary  vould 
be 


(10.8) 


Pyyty,.)  -  0 


on  the  boundary 


This  corresponds  to  (p  )  ■  d  .  and  thus  the  derivative  of  the  expected 

y  y  r 

cost  of  the  control  problem  satisfies  the  same  (f.b.p. )  as  do  the  optimal 


In  these  discussions  several  claims  were  made  which  require  some 
support.  First  let  us  deal  with  the  formulation  of  Problem  3*.  Suppose 
that  incoming  data  used  to  estimate  the  miss  distance  have  variance 
Inversely  proportional  to  the  distance  to  target.  Then  the  reasoning 
of  Section  4  applied  to  a  continuous  time  version  indicates  that  at  any 
given  tlme^the  posterior  distribution  of  the  miss  distance  is  ?f(Y(s),s) 
where  Y(s);  the  current  estimate  of  the  miss  distance^ is  a  Wiener 
process  in  the  -s  scale  and  s,  measuring  the  total  cumulated  precision 
is  given  by 

i*1 .  ♦  r* 

**  A 


where  tQ  is  the  total  required  tine  of  flight.  For  simplicity  let  us 

-2  -1 

assume  that  the  two  quantities,  ?0  and  or  ,  both  of  which  are 
ordinarily  small,  cancel  giving 


a*1  ■  ^(Vt)-1  . 

Let  us  also  assume  that  the  amount  of  fuel  required  to  change  the  mlaa 
distance  by  an  amount  A  la  proportional  to  the  distance  to  target. 

Then 

e(t)  »  ck*(t0-t)  ■  d"1^) 

and  hence 

(10.9)  d(s)  -  as"1  . 

Since  s  •  0  corresponds  to  Infinite  precision  and  time  of  arrival, 
the  cost  of  missing  is  kY^(O).  How  let  s*  »  h2s  and  y*(s*)  ■  hY(s). 
Then  Y*  is  a  Wiener  process  in  the  -s*  scale  and,  in  terms  of  s* 
and  Y* 

d(s)  •  d*(s*)  ■  ahV*1 
kY2(0)  •  kh‘2{Y*(e*)l8  . 

•2  X  P 

Selecting  kh  »  j  ah  gives  us  a  starred  problem  where  the  costs  are 
proportional  to  those  of  Problem  5*. 

The  fact  that  b  satisfies  the  heat  equation  in  the  interior  of 
the  no-action  set  follows  by  the  typical  argument.  This  in  turn  implies 
that  by  satisfies  the  heat  equation,  aquations  (10.6)  and  (10.7) 
follow  from  the  ay— try  and  terminal  cost. 

To  Justify  (10.$)  one  must  consider  behavior  near  the  boundary. 
Bare,  one  puxtling  aspect  of  the  continuous  time  version  of  our  policy 
which  was  deliberately  evaded  oust  now  be  faced.  Suppose  that  the 
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boundary  y^( 8 )  is  a  well  behaved  function  of  s.  Then  if  (y,s)  is 
on  the  action  set,  the  policy  calls  for  bringing  Y(s)  to  y^s).  later 
the  unadjusted  Y(s)  is  a  rather  complicated  function  and  is  bound  to 
leave  the  no  action  region  "iraediately"  after  it  Is  brought  to  tbe 
boundary.  How  does  one  compute  the  amount  of  fuel  that  is  used  in  tbe 
— ny  infinitesimal  departures  and  returns?  Fortunately  this  can  be 
conveniently  expressed  in  terms  of 

(10.10)  M(s)  *  nax{0,  sup  [Y,(s* )-y,(s' }} ) 

s  >  s'  >  s 
o  —  — 

where  Y^s')  is  the  original  (unadjusted)  Wiener  process  and  in  the 
neighborhood  of  an  upper  boundary  point  (yp(#0)>*0)  °f  the  no-act  tor- 
set,  the  adjusted  process  behaves  like 

(10.11)  Y(a)  •  Yx(s)  -  M(a} 

If  Y(s)  -  y1(sQ)  and  y^s)  has  finite  slope  then  for  snail  6, 

X(M(a  -»))  -#61/2M*)  where  H*  »  sup  V(t)  and  V(t)  is  a 
°  °  <  t  <  1 

standard  Wiener  process.  Moreover  the  uee  of  a  reflection  principle 
yields  P(K»  >  a)  •  2P(W(1)  >  a)  for  a  >  Q  end  hence 

(10.12)  4C(M(so-»))  •4D(e1^|v})  as  6-»0 
where  £(w)  *  >1(0,1). 

Mow,  to  dmonatrate  (10.5)  at  an  upper  boundary  point  (y0,»0)  it 
ia  easy  to  see  that  bj(>0#*0)  ■  d(§0).  W«  *haU  new  show  that 
h*(y^,s  )  «  d(sj  assuming  that  b(y#»)  -  b(y,e-&)  •  0(&)»  Between 
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time  sQ  and  sq-6,  the  process  originating  from  (y0,sQ)  is  adjusted 

by  a  total  amount  M(s  -ft)  and  at  time  s_-ft  is  at  Y(s  -ft)  «  Y,  (s  -ft)  - 

o  O  0X0 

M(s0-ft). 

^>(y0^0)  *  E(d(so)M(so-ft)  +  b(Y(s0-ft),sc-ft))  +  0(ft) 

*  £(b(y0,so)  +  'b"(Y1(«o-© )-yo-M(so-ft )]  +  d(s0)M(ao-ft)  +  0(ft)) 

-  *(yo>80)  -  lb“-d(s0)]E(M(so-ft))  +  o^2) 

Since  E{M(so-fi)}  is  approximately  (28/n)1^,  the  desired  result 
follows. 

Finally,  we  demonstrate  that  optimality  implies  that  p  ,  »  0  on 

w  v 

the  boundary.  First,  since  b  is  constant  above  (y  ,s  ), 

y  oo 

p*  (y  ,s  )  -  0.  Second,  since  jp  |  <  d(s  )  in  the  no-action  set, 

«/  J  V?  w  J  “ 

o’(yo,6Q)  >  0.  The  subopt iml  procedure  in  which  no  action  is  taken  from 
s0  +  ft  to  s0,  and  an  optimal  policy  is  followed  thereafter  has 
risk  b  where 
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p(y0,s0+6)<b  =  E{p(y0+vVr,  «0)J 

p(y0>s0+<5  )  <  p(yo#so)+Ewpy  5  +  ^UPyy5  )J  v  cplwJdw^PyyB)  J  v  <p(v)dw]+o(  6 

p  6  =  i  p"  ,6  <  i(  6  )[p"  ]  +  o(  8  ) 

Ms  2  Hyy  -  4'  lMyyJ  v 

p"  <  0 

yy  - 

which  implies  p  =  0. 

yy 

Thus-  we  see  that  and  &  ,  the  partial  derivative  of  the  optimal 
risk  and  the  optimal  no-action  set  for  the  control  problem^ correspond 
to  a  solution  of  the  free  boundary  problem  determined  by  d(s).  This  is 
the  case  even  without  the  benefit  of  the  symmetry  which  we  used.  Further¬ 
more  in  the  more  general  case  where  the  cost  of  fuel  per  unit  change 
of  y  is  represented  by  d(y,s),  the  boundary  conditions  for  p 

y 

would  be  p  =  d  and  p  «  d  . 

y  yy  y 

11.  Summary  and  Remarks. 

Approximating  discrete  time  problems  by  continuous  time  problems 
invoking  the  Wiener  process  makes  it  possible  to  apply  the  analytic  methods 
of  partial  differential  equations  to  problems  of  sequential  analysis, 
which  are  basically  special  examples  of  stopping  problems,  and  to  certain 
stochastic  control  problems.  It  was  seen  that  the  solution  of  stopping 
problems  reduce  to  the  solution  of  free  boundary  problems  involving  the 
heat  equation.  Almost  arbitrary  solutions  of  the  heat  equation  could 
be  used  to  provide  bounds  on  the  solution  of  stopping  problems.  Asymptotic 
expansions  for  the  solution  are  obtainable  by  use  of  relatively  simple 
classes  of  solutions  of  the  heat  equation. 
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It  is  difficult  to  state  and  prove  rigorously  "nice"  theorems  of 
the  kind  "the  solution  of  the  optimization  problem  is  a  solution  of  the 
f.b.p.".  It  would  be  desirable  to  have  such  theorems  which  invoke  only 
conditions  on  the  elements  in  the  statement  of  the  problem  such  as  the 
function  d(y,  s).  Most  proofs  seem  to  involve  conditions  on  the  nature 
of  the  unknown  solutions.  This  problem  seems  hard  to  avoid  because  the 
solutions  of  certain  problems  of  interest  have  singular  points  where 
the  f.b.p.  condition  breaks  down.  On  the  other  hand  the  sufficiency 
theorems  which  permit  one  to  recognize  when  a  given  candidate  is  a  solution 
of  the  optimisation  problem,  are  much  more  amenable  to  useful  statements 
which  can  be  reasonably  applied.  Fortunately  these  sufficiency  results 
are  the  more  important  ones  because  they  are  the  ones  invoked  in  applying 
the  methods  of  bounding  solutions  of  stated  problems  in  terms  of  arbitrary 
solutions  of  the  heat  equation  and  solutions  of  related  optimum  problems. 

The  rocket  control  problem  has  a  continuous  time  formulation  similar 
in  certain  aspects  to  that  of  the  stopping  problem  and  here  the  derivative 
of  the  optimal  expected  cost  also  is  a  solution  of  the  f.b.p. 

A  rocket  control  problem  where  fuel  is  free  but  only  a  finite  amount 
is  available,  is  more  difficult  to  treat.  A  role  analagous  to  that  of 
p  in  the  infinite  fuel  case  is  played  by  V  ■  p  +  e(s)p  where  u  is 

v  ^  J 

the  amount  of  fuel  available  and  e(s)  is  the  change  in  y  obtainable 
from  a  unit  of  fuel.  Since  V  measures  the  rate  of  gain  derived  from 
using  fuel,  V  <  0  on  the  no- action  set  and  V  ■  0  on  the  action  set. 
Bounds  and  expansions  have  been  derived  for  the  solution  of  this  problem 
subject  to  the  following  conjecture.  Let  a  taxed  version  of  the  control 
problem  be  such  that  at  s  *  sQ  fuel  is  free,  but  later,  (s  <  so),  fuel 
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4 


must  be  paid  for  and  fuel  remaining  at  s  =  0  must  be  taxed.  For  a 

situation  where  the  original  untaxed  problem  calls  for  the  use  of  fuel 

at  s  =  s  .  the  taxed  version  also  calls  for  the  use  of  fuel  then, 
o 

An  approximation  has  been  derived  which  relates  the  continuous  time 
solution  to  discrete  time  solutions  of  stopping  problems  with  finely- 
spaced  intervals  between  the  permitted  stopping  times. 

Work  has  been  carried  out  on  continuous  time  stopping  problems  which 
do  not  involve  the  heat  equation.  These  include  that  of  Mikhalevich 
[19]  where  the  Poisson  process  leads  to  a  difference  differential  equation 
and  a  pair  of  diffusion  equations  of  Shiryaev  [22]  where  the  number  of 
possible  values  of  n  are  finite  and  past  history  is  summarized  by  a 
few  posterior  probabilities  rather  than  (Y(s),s).  Bather  [4,5]  ha! 
considered  certain  problems  which  involve  ordinary  differential  equations 

i 

because  of  the  stationarity  of  these  problems.  * 

The  history  of  these  ideas  and  methods  is  long  and  complicated  and 
the  following  is  a  bare  outline  of  related  references. 

Backward  induction  -  Dynamic  programming  -  [2],  [8]. 

Stopping  problems  (discrete  time)  -  [14],  [17 3 ^  [24],  [25]. 

Stopping  problems  (continuous  time)  -  [1],  [3],  [4],  [5],  [9],  [1Q]> 

[11],  [12],  [13],  [16],  [191, 

[20],  [22] ' 

Rocket  control  -  [6],  [7],  [21],  [26]. 

Heat  equation  -  [15 ]  - 
Expository  article  -  [23]. 
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Closure  under  the  Formation  of  Coherent  Structures 

Perhaps  the  most  common  structures  for  reliability  consideration  are  the 

series  structures 


with  Boolean  structure  function 

*(*)  *  Xj  ...  XR 

where  the  indicator  variable 

1  if  the  ith  component,  c^  ,  works 
0  otherwise 

and  the  parallel  structures 


with  Boolean  structure  function 

*<x)  -  ^  V  x2  V  ...  V  xn 


2 


where  x  V  y  •  1  if  and  only  if  x  •  1  or  y  ■  1  or  x  -  1  ,  y  •  1  .  Also 
Important  are  the  k  out  of  n  structures  which  work  if  any  k  or  Bore  of  the 
n  components  work.  The  2  out  of  3  structure,  for  exanple,  can  also  be  represented 
in  teras  of  series  end  parallel  etructures  if  we  allow  replication;  i.e., 

C1  c2 

i- .  »  . 

c2  c3 

- - +-  - - 

C1  c3 

•  - 

The  Boolean  structure  function  for  this  special  structure  is 

♦<*>  "  *!*2  V  *2*3  V  *1*3  ' 

More  generally,  we  have  the  following: 

DKTIWITIOfl  A  ooharant  atruotura  is  a  couple  (C,  4)  consisting  of: 

1)  a  set  of  cosponants  C  -  jc^  c2>  ....  cq|  ; 

2)  a  Boolean  function  4  defined  on  vectors  x  •  ( x^ ,  . . . ,  xq  )  of 
binary  indicator  variables  and  satisfying 

(i)  4(0)  •  0  and  4(1)  «  1  ; 

(ii)  x  <  x  (coordinatewise)  iwpliea  4(x)  <  4Ct) 

Let  X^  be  a  binary  random  variable  corresponding  to  the  1th  component 

PlXt  -  1]  -  P4 


and  let 
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PlXi  "  °J  *  1  -  Pt  *  <4  • 

Let  X-  (X^,  Xj,  ....  Xn)  ,  £  ■  (Py  P2 . Pn)  *nd  that  the  Xj'a 

•re  mutually  Independent. 


DEFINITION  The  reliability  funetion  of  the  coherent  structure  (C,  ♦)  la 

h(£)  -  P(4(X)  -  1  I  £j  • 

It  la  easy  to  show  that  h(£.)  la  an  Increasing  function  of  £  for  coherent 
structures. 

We  now  wish  to  study  the  random  tim  at  which  a  coherent  structure  fails  as 

distinct  from  describing  ii:s  condition  at  a  apaaifitd  time.  To  do  this,  let 

T.,  T_,  ....  T  denote  the  failure  times  of  components.  The  reliability  of  the 
12  n 

1th  component  at  time  t  la 

f4(o  -  ^  .  4  • 

let 


x1(t) 


1  if  Tt  >  t 
0  otherwise 


end  let  T  be  the  time  to  failure  of  ths  structure.  Then  T  >  r  if  sad  only  if 
♦(XU ) ]  •  l  where  X(t)  •  (X^(t),  Xj(t),  ...,  XB<t))  .  The  reliability  at  tlms 
t  of  ths  structure  is 

?<t)  •  1  -  F(e)  •  F(T  >  t)  -  -  l) 

•  htftOl 

where  £(t)  •  (fj(t),  Fj(t) . JB(t))  * 

Suppose  we  build  a  coherent  structure  from  stochastically  independent  com¬ 
ponents  whose  failure  times  follow  sn  exponent la1  law;  l.s., 
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(t )  »  1  -  exp(-Ajt)  for  A^  ,  t  >  0  . 


If  tha  atructure  la  a  aarlea  structure  than  tha  llfetiae  of  the  atructura,  T  , 
again  baa  an  azponantial  dlatribution.  Howavar,  In  tha  parallal  caaa  it  ia  aaay  to 
verify  that  T  ia  not  exponentially  dlatributed.  What  can  wa  aay  in  general  about 
tha  propertlea  of  tha  dlatribution  of  T  ?  Birnbaua,  Baary  and  Harahall  (1966) 
have  actually  eharccterlaed  thia  claaa  of  diatributiona.  It  la  perhapa  aaalaat  to 
daacriba  this  claaa  in  tana  of  tha  failure  rata  function.  Let  F  ba  a  dlatribution 
on  (0,  ■>)  with  danalty  f  and 

r(t)  •  x  "ffiy  for  t  >  0  . 


Than,  intuitively,  r(t)dt  ia  tha  conditional  probability  of  failure  in  (t,  t  +  dt) 
given  aurvival  to  tine  t  .  Note  that 


?<t) 


DEFIMITIOK  A  dlatribution  F  auch  that  F(0)  •  0  la  called  JFRA  (for  incraaalng 
failure  rata  average)  if  and  only  if 


logf(t)/t 


la  nondacreaalng  in  t  >  0  . 

If  F  hea  a  danalty  f  ,  than  it  ia  aaay  to  verify  that  F  la  IFIA  iff 
r (u)du  la  noodacraaalng  ia  c  >  0  .  Note  that  exponential  diatributiona  are 
IFRA. 

THOMH  It  (Utnbeua,  laary  and  Harahall).  The  IFIA  claaa  of  diatributiona  ia 
cloned  under  tha  foraatloe  of  coherent  atructurea.  Fur tha more,  tha  cloaura  under 
coherent  atructurea  of  tha  exponential  claaa  of  diatributiona  la  danaa  in  tha  IFIA 
claaa  with  raapact  to  linita  in  dlatribution;  i.e.. 
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{ IFRA}CS  -  (IFRA)  {•jrp)CS*  10  . 

We  omic  Che  proof  Co  chle  thaoren.  The  key  to  the  proof  of  this  theorea  la  ea 
inequality  which  is  of  independent  interest. 

THEOREM  2:  (Blrnbeum,  Eeery  end  Marshall).  If  h  le  the  reliability  function  of 
e  coherent  structure  end  it  is  defined  on  [0,  1]  by  either 

(1)  <«(u)  -  -u  log  u 

(ii)  *(u)  •  -(1  -  u)  log  (1  -  u)  (the  dual  of  (1)) 
or  (lit)  *(u)  ■  u(i  ~  u) 


than  the  inequality 

n  3h(j>) 

(!)  £  Mpi)  i 

i«l  rl 

holds  for  all  £  vectoie. 

1(  p  *  p,  •  ...  -  p  «  p  end  *(u)  •  u(l  -  u)  then 
i  1  n 

im.f 

l-l  »»l 

and  (1)  hccoacs 

PR  ^dp*^  *  *  h(p)} 


for  0  <  p  »  1  .  (The  strict  inequality  can  be  shown  for  p  In  the  open  interval 


using  the  fact  that  all  components  are  aeeuaed  essential.)  If  h(p  )  •  p  ,  then 


h'i>0) 


h(Mll  -  mm! 


m1 - *.) 


-  -  1  : 


i.e.,  at  a  crossing  point  of  h  (p)  J  p  by  h(p)  we  eee  that  h(p)  le  increeeing 


end  hee  slope  >  1  so  that  we  have  the  situation  in  Flgur.  1. 


FIGURE  1 


Since  h(p)  Is  Increasing  It  can  cross  h(p)  ■  p  at  most  once  and  from  below. 
The  usefulness  of  this  result  follows  from  the  fact  that  a  redundant  structure 
with  reliability  function  h(p)  will  have  higher  reliability  than  a  single  com¬ 
ponent  for  component  reliability  p  >  pQ  .  This  result  was  first  discovered  by 
Moore  and  Shannon  (1956)  for  two  terminal  networks. 

PROOF  OF  THEOREM  2: 

The  proof  is  by  induction.  For  n  »  1  ,  $(x)  -  x  with  h(p)  ■  p  and  we 
have  equality  in  (1). 

Now  we  assume  the  theorem  is  true  for  n  -  1  .  We  claim  (1)  is  true  for 
h(l„*  £.)  •  Either  $(1Q,  i)  coherent  or  $(ln*  i)  =  1  •  In  either  case 
h(ln,  j>j  satisfies  (1). 

We  claim  (1)  is  true  for  h(0n,  £.)  •  Either  ${0Q,  *,)  ie  coherent  or 
*(0n,  j)  =  0,  In  either  case,  hjO^,  p.)  satisfies  (1). 

Now  *(x)  -  xn*(ln.  i)  +  (1  -  xn)*(°n*  i)  *nd 


■*  * 

0 


Also 


Hence 


(2) 


h<£)  "  E[*<X)]  -  pnh(ln,  £)  +  (l  -  Pn)h(On,  £)  . 


3h  (£) 

15“  "  h(1n*  2)  -  b(0B,  £)  . 


“  3h<£)  n^l  3h(£) 


£  "5T  *,p‘i'  £  -Sf  ♦fit 

♦  !»(v  *)  -  mv  £)|*(pj  • 

Now  we  substitute  into  (2)  using 

»(£)  •  P„h(ln.  £)♦(!-  pjh(0  e) 


so  that 


+  (1  •  p»>  £  *w 

*  Nv  *>  -  h(V  2)|*(P„)  • 

By  the  induction  hypothesis,  this  is 

-  ViH1,'  e.)|+  (i  -  p„)»[h(on,  £)| 

+  |MV  e)  -  MV  *)!♦(».)  ■ 

will  be  done  if  we  can  show 

pn*Ih(V  *)l+  I1  -  Pn)Hh(°„*  *)| 

*  lh(V  z)  -  h(°„*  £)|  *(Pn)  >  <<[h(£)) 
Ut  r  "  pn  *  hl  -  h(V  p)  and  h(°„.  £)  -  h  •  To  show 
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r+(hl)  +  (1  '  ^)*(h0)  +  [hx  -  hj*(r) 

>  <(/ |rh^  +  (1  -  r)hQ]  , 

l.t.,  to  show 

♦K  +  a  "  r)hol  "  *(ho) 

<  r*^)  -  r*(h0)+  (hx  -  ho)*(r)  . 

TO  SHOW  (i):  Let  $(u)  ■  -u  log  u  .  We  claim 

r*(hi)  -  r*(ho)  +  (hl  -  h0)*(r)  *  *(Thl)  -  *(rho)  • 

Substituting  in  for  $(u)  it  is  obvious.  Hence  we  need  only  show 

♦|rhl  +  (1  “  r)ho|  -  *[ho|  <  *[rhj  -  *[rhj  . 

This  is  geometrically  obvious  from  the  concavity  of  <|i(u)  »  -u  log  u  . 

TO  SHOW  (lil):  Let  *(u)  *  u(l  -  u).  We  need  only  show 

^(1  -  hj)  4  <1  -  r)h0jl  -  h0)  ♦(hj  -  hQ)r(l  -  r) 

.  (rtj  +  (1  -  DhJ  [l  -  rhj  -  (l  -  t)hj 

or 

2 

-rh2  -  (1  «  r)h2  +  (hj  -  hjr(l  -  r)  >  -  [rhj  +  (1  -  r)ho| 
or 

r^h?  ♦  2r(l  -  r)h  h,  «■  (1  -  r)2h2  -  rh2  -  <1  -  r)h2 
1  0  1  0  1  o 

♦  (hj  -  ho)r(l  -  r)  >  0 

hf  ♦  2h  h,  -  h2  ♦  h.  -  h  >  0 

1  0  1  O  1  o  • 


or 
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or 

(hi  •  h„) :° 

which  is  obvious.// 


Bounds  on  Failure  Distributions 

Classes  of  Failure  Distributions.  The  IFRA  Failure  distributions  Mentioned 
earlier  are  theoretically  attractive  because  of  their  closure  property  with 
respect  to  coherent  structures.  They  also  possess  an  interesting  graphical 
property  which  is  useful  in  theoretical  investigations.  Let 


C(x) 


1  -  e"X  for  x  >  0 
0  otherwise. 


Then  the  graph  of 


-logfl  -  F(x) ]  -  G~*F(x) 


is  starshaped  with  respect  to  the  origin  for  x  >  0  ;  i.e., 
implies  that  the  "upper  side"  of  every  point  on  the  graph  of 
"visible"  from  the  origin.  Figure  2  is  an  illustration  of  a 
with  respect  to  the  origin  which  is  not  convex. 


Silsi.  i.  „>o 

x 

C_1F(x)  is 
starshaped  function 
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FIGURE  2. 

Not*  chat  G_1F(x)  la  convex  for  x  >  0  if  and  only  if  it  is  starshaped  with 
respect  to  every  point  on  its  graph. 

In  replacement  policy  problems  especially,  one  is  often  concerned  with  the 
conditional  failura  distribution  given  survival  to  time  t  ;  i.e., 

F  (x)  »  P{X  >  t  +  x  |  X  >  t}  •  F-&  . 

*  F(t) 

It  is  mathematically  convenient,  and  also  intuitively  plausible  in  soma  situations, 
that  the  conditional  failure  distribution  of  an  assembly  should  also  exhibit 
some  "was rout"  characteristic  if  the  original  failure  distribution  exhibits  such 
a  characteristic.  It  is,  however,  easy  to  provide  examples  of  IFRA  distributions 
which  are  not  only  not  conditionally  IFRA  but  are  in  fact  conditionally  DFRA  for 
some  t  >  0  .  Hence  we  may  ask  the  question:  Vhat  is  th$  largott  olato  of 
< attributions  in  tho  IFRA  olaee  whioh  remain  IFRA  upon  oonditioning  on  tho  loft? 


That  is  we  want 


X  X 


nondecreasing  in  x  >  0  for  every  t  >  0  .  Clearly  this  will  be  true  only  if 
the  graph  of  -logf(x)  is  starshaped  with  respect  to  every  point  on  the  graph. 

It  follows  that  -logF(x)  is  convex  for  x  >  0  .  If  F  has  a  density  f  , 
then  the  failure  rate  function 

r(x)  .  -Jlxl- 

KX}  1  -  F(x) 

must  be  nondecreasing  in  x  >  0  .  We  call  this  class  of  distributions  the  IFR 
class,  for  increasing  failure  rate. 

As  we  have  seen,  the  exponential  distribution  provides  the  basis  for  an 
interesting  hierarchy  of  failure  distributions.  The  representation  in  Figure  3, 
suggested  by  James  Esary,  emphasizes  the  central  role  of  the  exponential  dlatribu* 
tion.  Special  classes  noted  in  the  figure  are  the  Weibull  class  with  densities 

f(t)  •  aXta  1e"^t  for  a  ,  X  ,  t  >  0 
end  the  gamma  class  with  densities 

;  (t)  -  X  *”Xt  for  »  ,  X  ,  t  >  0 
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The  No-Data  Problem.  In  Che  aerospace  and  electronics  industries  one  of  e 
kind  assemblies  sre  common.  For  such  assemblies  no  life  testing  results  are 
available.  All  that  is  svail&hle  in  many  cases  is  che  engineers'  past  experience 
with  similar  components.  Contractual  obligations  however  often  require  a  re¬ 
liability  statement  -  also  warranties  require  a  reliability  assessment;  e.g.t  It 
may  be  required  that  the  assembly  operate  properly  for  1000  hours  with  probability 
.99. 


The  mean  life  of  an  assembly  le  a  concept  with  which  all  laymen  are  familiar 
and  engineers  will  make  statements  in  terms  of  mean  life  far  sooner  than  they  will 
make  a  probability  statement.  Hence,  even  in  the  absence  of  data,  engineers  will 
often  estimate  the  mean  life  for  a  new  piece  of  equipment  based  on  past  experience. 
We  can  translate  this  mean  life  statement  Into  a  conservative  probability  state¬ 
ment  using  bounds  based  only  on  intuitively  reasonable  assumptions.  Mors  gmnerally, 
given  an  rth  moment  we  can  state  the  following  result  (see  Barlow  and  Marshall 


(1964)) : 

THEOREM  3:  If  F  is  IFR,  F(0)  •  0  ,  r  >  1  and  u 


1  -  F(t) 


r  ±1  0  a 

*  *xpj-t/(*r)r j  t  «  Mfr 


dF(x)  ,  then 


U) 


>  0 


t  >  u 


where  •  ur/T(r  +  1)  .  This  inequality  is  sharp. 


PROOF 


We  con  actually  prove  a  more  general  but  lees  motivated  result  and  the  proof 
ie  easy.  Suppose  F  sod  G  srs  any  two  continuous  distributions  satisfying 

F(0)  »  G(0)  •  0  ,  C"lF(x)  is  convex  for  x  >  0  end  J%Tdt( x)  • J  *TdC(x)  •  a* 


for  r  »  1  .  Let  X(Y)  have  distribution  F  (G)  and  Xr(Yr)  have  distribution 
Fp  (Gf)  .  We  claim  G^F^x)  is  convex  in  x  >  0  .  Note 
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3^Fr<x)  - 


and  assuming  dlfferentlabllty 


4"  G_1F  (*)  -  r 
dx  r  r 


r-1 


(g-1f) 


(«') . 


The  first  factor  in  increasing  in  x  since  G~^F  is  starshaped  and  r  >  1  .  The 
second  factor  is  increasing  in  x  since  G  is  convex. 

Row  let  X^,  X^,  ....  X;  ...»  Y*  j  denote  a  random  sample  from 


F_ (G_)  .  Then  since  G  r  is  convex 


r  r 


r  r 


G_1F 
r  r 


ax 


- :  t,  c',l'rK) 


and 


L  i 

J 

i-1 

n 

-e 

n 

F^ 

» 

2X 

<  G 

-  i 

ilXSKi 

L  i  J 

.  1 

•  G 

st  r 


J2X 

1 


where  denotes  stochastic  equality.  Letting  n  +  »  ,  we  have  by  the  strong 
law  of  large  numbers 


W  :  W 


or 


F(ur)r  «  C(ur)r 


Since  F  crosses  G  at  most  once  and  from  below  if  at  all,  we  have 

1 

F(t)  <  C(t)  for  t  <  u  r  . 


-  r 


Letting  C(t) 


r  Ai 

•  1  -  expl-t/*  r  I  for  t  >  0  we 
w  r  J 


easily  see  that 


/tfdG(t,  -  uf 
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and  (1)  la  Immediate.  Since  F  IFR  allows  the  possibility  of  a  jump  at  the  right 
end  of  Its  Interval  of  support,  the  proof  is  completed  using  limiting  arguments. 

1  A 

The  bound  for  t  >  p  r  is  attained  by  the  d.istribu:  ttion  degenerate  at  ufT  , 

which  Is  a  limit  <n  IFR  distributions.// 

I 

Since  it  is  well  known  that  (u^)r  18  always  nondecreasing  in  r  >  0  for 

distributions  on  the  positive  axis,  we  see  that  higher  moments  enable  us  to  obtain 

1 

nontrivial  bounds  over  a  greater  range.  To  prove  that  (u  )r  ia  nondecreasing  in 

r  r! 

r  >  0  for  distributions  on  the  positive  axis,  let  $(x)  ■  xr  where  r  <  r'  . 

Then  $  is  convex  for  x  >  0  and 


*  [i$<]  • 


[&4  -  [i^f  • 


where  X^,  . is  a  random  sample  from  a  distribution  F  such  that 


F(0  )  •  0  and  i» 


'j'' 


dF(x)  .  Applying  the  strong  law  of  large  numbers  we  have 


Cw  )F  «  (v  .)r 
r  •*  r 


for  r  <  r'  . 


Unfortunately  the  nontrivial  part  of  the  bound  In  (1)  la  decreasing  in  r  »  1 

A  f  i  A 

This  follows  from  the  fact  that  for  IFRA  distributions  1^*  •  1  f (g  ♦  i) 1  ** 
decreasing  in  r  >  0  .  or  equivalently,  -logi  is  starshaped  for  r  >  0  .  It  Is 
interesting  that  for  1  FRA  and  IFR  distributions,  the  geometrical  properties  of 
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F(x)  are  inherited  by  the  "normalized"  moments  X^  ■  \i^/T( r  +  1)  ;  i.e,,  if 
-logF(x)  is  starshaped  (convex)  in  x  >  0  ,  then  -log.\r  is  starshaped  (convex) 
in  r  >  0  . 


»  i  — 

i  4:  If  F  is  I  FRA  and  yf  xrdF(x)  then  is 


THEOREM 


noninr.reasing  in  r  >  0  . 


PROOF 


Let  s  <  t  and  note 


"•  ‘  ‘P1 
0 
00 

f  s-1 

■six 


F(x)dx 


/  s-1  -x/X  “  . 
six  e  s  dx  . 


s— s— 1  — x/A 

Since  F  is  IFRA,  x  F(x)  crosses  x  e  s  exactly  once  and  from  above, 
say  at  xq  .  Hence  if  $  is  an  increasing  function 


J' <j>(x)xs  *F(x)dx  - J" 4>(x)xS  ^e  x^s  dx 

°  r 0  fi 

-  /  [*(x)  -  $(xo)][xS"1F(x)  -  xS~Vx/AsJdx  <  0  . 


Let  (x)  ■  x  .  Then 


00  00  f”  ^1 

s  /  t-s f  s-1-,  0  ,  /  t-s  s-1  -x/X  8  L 

—  p  "lx  sx  F(x)  dx  <  I  x  |sx  e  s  Jd> 

4 » • 


or  (Xfc) C  <  (^8)8  which  was  to  be  proved.// 


Additional  probability  bounds  may  be  found  in  Barlow  and  Marshall  (1964), 


(1965)  and  (1966). 
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1.  Introduction 

We  are  concerned  with  the  optimal  control  of 
certain  types  of  dynamic  systems.  We  assume  such  a  system 
is  observed  periodically  at  times  t*0,l,...  .  After  each 

observation  the  system  is  classified  into  one  of  a  possible 
number  of  states.  Let  I  denote  the  space  of  possible 
states.  I  will  be  assumed  to  be  either  finite  or  denumerable. 
After  each  classification  one  of  a  possible  number  of 
decisions  is  implemented.  Let  denote  the  number  of 
possible  decisions  when  the  system  is  in  state  i,  i  6  X. 

The  sequence  of  implemented  decisions  interactewith  the 
chance  environment  to  effect  the  evolution  of  the  system. 

This  research  was  supported  by  the  Army,  Navy.  Air  Force 
and  MAfA  under  a  contract  administered  by  the  Office  of 
Naval  Research)  Contract  Moor  266(55)  ”  MR-042-099*  Re¬ 
production  in  whole  or  in  part  is  permitted  for  any 
purpose  of  the  United  ttstes  Government. 
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more  specifically  let 


Ut}. 


denote 


the  successive  observed . states  of  the  system.  Let 
{  At  },  t»0,l,...  denote  the  successive  decisions, 

ed  »f>wH^iew>4asdfc»  Assume  that  when  Y.  ■  i  and 

.  {  w..  } 


A«k  a  knows  cost  w^  is  incurred.  The  nuafcers 


ik 


may  be  expected  costs  rather  than  actual  costs.  In  such 
a  case  we  assume  a  distribution  of  costs  dependent  upon 
the  state  i  and  decision  k  from  which  the  expected  cost 
can  be  computed. 

A  rule  or  policy  R  for  controlling  the  system  is 
a  set  of  functions  {  (ht_1#  Yt)  },  t-0, . .  where 

ht  "  {  Y0'  A0 . Yt*  At  }'  ®k  ^  *nd  X  ®k  1 

k 


°k  (ht-l*  Yt^  ia  to  b*  interpreted  as  the  probability  of 
implementing  decision  k  at  time  t  given  the  "history" 
and  the  "present  state"  Yt-  Thus  a  rule  specifies,  at 
each  point  in  time,  a  chance  mechanism  to  be  used  in 
deciding  which  action  to  take.  The  rule  is  only  permitted 
to  depend  on  the  history  of  states  and  decisions. 

Given  a  rule  R  and  a  probability  distribution  over 
the  initial  state  Yq#  we  assuaw  the  sequence  (  Yfc,  At  } 
is  a  stochastic  process  defined  over  the  joints  space  of  Z 
and  the  possible  decisions.  Throughout  we  shall  assume  that 
P(Yq  •  i),  i  €  Z,  is  known,  moreover,  we  assua*  that  there 
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are  known  transition  probabilities  \  (k)  /  such  that 

for  every  i,j  and  k 


i, 


*  k 


} 


independent  of  t  and  hfc_^  In  words,  when,  at  any  tine  t, 
the  state  i  is  observed  and  decision  k  is  made,  then 
(k)  denotes  the  probability  that  the  system  will  be 
observed  in  state  J  at  time  t  +  1.  under  this  Utter 
assumption  we  refer  to  the  process  {  Yfc,  At  }  as  a  Markovian 

Decision  Process .  It  shall  be  esphasised,  however  that 
{  Yt*  At  }  la  not  necessarily  a  Markov  process.  For  When 
R  is  such  that  (h^,  Yt  )  is  a  function  of  h^.  the 
process  {  Yt,  ^  }  will  not  be  Markovian.  If,  however, 

Wl'  Yt*  iu  *  Action  of  Yt  and  t  for  every  t-0,1, ..., 
then  {  Yt  }  is  a  Markov  process,  And,  if  (\„v  Yt)  is 
only  a  function  of  Yt. for  every  t  then  {  Yt  }  is  a  Markov 

chain  with  stationary  transition  probabilities. 

^  »  t  *  0,  1,  ...  be  defined  ss  follows  i 

"t  *  "ik  1£  Yt  ’  4'  *  k*  **  1,...,^,!  €  X. 
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divan  a  policy  R  and  an  iniuai  state  yq  ■  i,  then  the 
sequence  {  Wfc  },  t  ■  0,  1,  ...»  is  a  stochastic  process. 
We  can  speak  of  the  expected  cost  at  time  t  as 


SRWt 


ll 

j  k 


j. 


where  ER  and  PR 
under  the  policy 
1.  *ik  )  are  such 


denotes  the  expectation  and  probability 
R.  We,  of  course,  assume  that  the  costs 
that  ER  Wfc  exists. 


Let 


(i) 


R.T 


i.e.  cpR -y(i )  is  the  expected  average  cost  incurred  by  the 
system  up  to  time  y  given  Yq  »  i  and  R  is  the  policy  con¬ 
trolling  the  system. 


Let 

< P  (i)  *  inf  op  (i) 

R  R«T 


Hit  JZIflfeltB  imdil  consideration  ia  this  exposition  i£  that 
JZ£  lidOiM  JLt £  minimise  cpR  (i)  . 

For  what  follows  it  is  convenient  to  consider 
thfSS  classes  of  policies.  The  first  is  the  class  of  all 
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policies  of  the  form  described.  That  is,  all  policies  which, 
at  each  point  in  time,  use  past  states  and  decisions  as  a 
basis  for  making  a  decision.  We  let  0  denote  this  class. 

The  second  class  is  the  class  which  uses,  at  each  point  in 
time,  the  state  of  the  system  it  that  instant  as  a  basis 
for  making  a  decision.  We  shall  refer  to  this  class  as 
the  class  of  stationary  Markovian  policies  and  denote  this 
class  by  C.  The  third  cljss  is  the  sub  class  of  C'  in 
which  the  policies  are  not  of  a  random  character.  We 
denote  this  class  by  CB.  A  policy  R  €  C"  may  be  thought 
of  as  a  function  defined  over  the  states  with  range  in  the 
set  of  possible  decisions;  to  each  state  there  corresponds 
a  unique  decision.  We  refer  to  QH  as  the  class  of  dstermlnAlfru 
stationary  Markovian  policies. 

We  shall  divide  the  following  discussion  into  .two 
parts.  One  for  the1  Case  where  I  is  finite;  theother,  for 
where  1  is  denumerable.  In  going  from  the  finite  to  the 
denumerably  infinite,  mathematical  questions  arise  which 
are  not  yet  settled. 

2.  Finite  Humber  of  States. 

The  fundamental  fact  concerning  the  probleei  at 
hand  for  this  finite  state  case  can  be  summarised  ss 
Theorem  it  Jf  X1  <  •  ,  i  €  I,  afl£  I  if  finite,  then  there 
.  policy  R  €  c-  aHi£h  flinHius  i  €  I. 
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The  above  theorem  ha a  been  more  or  lees,  proved,  by  aeveral 
Authors  (See  [1],  [2],  [3],  [4]).  Each  prcof  relies  on 
what  we  refer  to  as 

Theorem  2.  If  <  •,  i  €  X,  and  1  is  finite,  then  there 
exists  a,  policy  Ra  €  C"  which  minimises 

♦*  (i.a)  *  l  *„  "t  .  t  €  I 

where  a  is  a  given  number  between  sero  and  one. 

♦R  (i,a)  if  often  of  economic  relevance.  It  is  referred  to 
as  the  expected  discounted  (with  discount  factor  a)  cost 
criterion. 

Theorem  2  is  usually  taken  as  self-evident.  However,  for 
proof  see  [3]  end  [6].  On  letting  a  -  1  end  using  an 
appropriate  Tauberian  theorem  £e.g.  l£gi  (1-a)  fR(i,a)  «  cpR(i) 

»h.n  R  t.  .uch  th.t  »R  (i)  -  )  th.arM  j  e.„ 


established. 
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Theorem  1  carries  with  it  two  advantages.  ltoe 
class  C"  contains  only  s  finite  number  of  policies  'even 
though  the  number  msy  be  astronomically  large)  and  under 
any  R  €  C"  ,  (  Yt  }  is  a  Markov  chain  with  stationary 

transition  probabilities.  As  a  result,  finite  algorithms 
for  minimising  qpR  (i)  over  R  €  C*  are  obtainable. 

Two  methods  for  obtaining  the  optimal  R  €  C" 
have  been  advanced.  One  method  involves  linear  programming, 
(see  (7],  (3]  ).  The  other  is  a  derivste  of  dynamic  pro¬ 
gramming  (see  [Qb  [2]).  lie  indicate  the  latter  method 
firat. 

_  -c* 

let  R  ♦llPC*  be  an  arbitrary  policy^  ,  it  can.  be 
shown  that  there  exists  a  unique  set  of  numbers 

{  gj  .  v*  }  ,  i  €  I  satisfying 

i 

(1)  *u  ♦  f  fu  (*)»*•  i  « *• 

i 

and 

(2)  I  ® ?1  *  0  .  i  <  I- 

3 


where  wiR  and  *^(r) 
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l  * 


denote  the  cost  and  transition  probability  (k) 

involvad  at  atata  i  undar  policy  R;  is  tha  limiting 
(aa  T-*»)  expected  proportion  of  tima  that  tha  systam  is  in 


atata  j  givan  it  is  initially  in  atata  i.  Th«  thaory  of 
Markov  chains  indicatas  how  tha  limiting  valueit  {n*^} 
can  ba  obtainad. 


Ona  can  also,  aaa  that 


V  n* 

1  11  ij 
j€l 


"i*  ■  ** w  ■ 


For  aach  i  €  X.  lat  B, 


dacisions  for  which  aithar  (a) 
A 


danota  tha  sat  of 


X  to  (<0  «,*  <  9/ 

j 


or  (b) 

X  fi)  00  *  n* 

1 


•it.  ♦  X  Hi  00  <  SiR  *  . 

Jafina  a  policy  R*  as  followas  For  at  laast  ona  i  ouch 
that  ia  non-empty  praacriba  a  decision  in  whan  in 
atata  i.  For  all  atataa  i  whara  »A  is  empty  or  where  one 


does  not  prescribe  a  decision  in  ,  make  the  decision 
dictated  by  policy  R.  We  shall  call  the  mapping  from 
R  to  R’  a  policy  iteration. 

It  can  be  shown  that  qpRI  (i)  £  qpR  (i)  *  i  €  I. 
The equality  may  hold  because  of  the  possible  presence  of 
transient  states  in  the  Markov  chain  associated  with  policy 
R' .  Thus  the  policy  improvement  procedure  starts  with  an 
arbitrary  policy  Ro  and  carries  out  successive  policy 
iterations  until  no  more  can  be  made.  Since  there  are  only 
a  finite  number  of  policies  in  C"  this  stage  must  be 
reached.  (Actually,  this  is  not  obvious  because  of  the 
possibility  that  <pRl(i)  «  «pR( i )  ,  i  €  I*  That  is,  an 
argument  must  be  made  to  show  that  cycles  of  policies  will 
not  occur. )  At  the  termination  of  the  successive  policy 
iterations  we  must  have,  if  R»  is  the  terminal  policy . 

% 

(3)  9**  ♦  vf  -  .in  {wu  +  T  vj*},  i  « 

I 


l  1  <  * 


.  i  €  I. 
A 


Prom  thAVe  e&uetleaa  it  can  be  shown  that  R*  is  optimal. 


Thus,  the  sequence  of  policy  iterations  terminates  at  an 
optimal  policy  within  a  finite  number  of  iterations. 
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The  policy  improvement  procedure  simplifies 
under  the  assumption 

(a)  I  contains  e£  at  most  one  ergodic  class 
of  states  for  every  R  €  C". 

a.  ^ 

Under  (A^g^  "  independent  of  the  initial  state  i, 

part  (a)  of  the  definition  of  is  unnecessary,  and 
(4)  of  the  terminal  will  always  hold. 


Let  <p£°)  (i) 

manner  similar  to  cpR(i) 


m*l,...,Mbe  defined  in  a 
except  that  cp£m)  (i)  !• 


defined  with  respect  to  costs  |  w(™^  }'  m?«l,...,M. 

Although  the  policy  improvement  procedure  is  powerful 
enough  to  obtain  an  optimal  solution  to  the  problem  under 
discussion^  v.nder  general  conditions  (provided  the  number 
of  states  is  not  too  large) ^ it  does  not  provide  an  algorithm 
for  soving  the  more  complicated  problem: 


Minimize  cp  (i) 

K 


subject  to 


(m) 

(i) 


m  *  1, . . .  ,M 


where  b  ,  m  *  1,...,M  are  given  constants 
m 


When  (A)  holds  the  method  of  linear  programming 
is  effective  for  obtaining  an  optimal  policy.  Under  (A) 
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'  II  Xik  wik 

i  k 

subject  to  Xjjc  >  0  k*l#...,K^  ,  i  €  I 

l  Xjk  -  l  l  Xik  fcj  W  -  O'  1  «  1 

k  i  k 


j  * 


I  I  Xik  Wik^  *  bm  '  .  . 

If  {  x*^  ]■  ie  a  solution  to  the  above  problem  then  the 
optimal  policy  R*  €  C1  is  defined  by  setting 

Dik  ■  **  —  if  I  **tk  >  0  - 

I-  * 

and  arbitrary  if  £  x#ik  "  ®  * 

k 


If  M  ■  0,  i.e.  no  additional  conatrainta  ara 
imposed,  tha  mathod  ia  an  altarnativa  to  tha  policy 
itaration  procadura  undar  condition  (A). 

Tha  question  of  how  to  aolva  tha  problem  with 
additional  conatrainta  without  assuming  (A)  remains  open. 

In  general,  an  optimal  policy  naad  not  axiat  in  C*.  One 
can  assert.  howavar ,  (saa  [9]) that  undar  (a)  an  optimal 
policy  willVexist  in  C. 

It  ahould  ba  pointad  out  that  tha  aolution  to 
tha  problem  of  minimising  qpR(i)  naad  not  ba  unique.  Tha 
quaation  ariaaa  as  to  whathar  soaw  solutions  might  not  ba 
bat tar  than  othara.  That  is.  othar  criteridh,  not 
explicitly  put  into  tha  problam.  may.  in  part,  ba  relevant. 
For  axampla.  it  has  bean  shown  (saa  [2]  £.  that  there 
exists  a  policy  R*  €  c"  such  that  fR  (i.a)  is  minimised 
for  all  a  and  near  enough  to  1.  Whan  this  ia  tha  case 
R*  also  minimises  $R(i).  Vo  computational  procadura  has 
yat  bean  given  to  find  such  a  policy.  However,  a 
procadura  (saa  [10 3) has  bean  given  for  finding  a  policy 
R**  having  tha  property  that 

4S!  [*R*  “  ♦rss  (*'*)  ]  •  0  . 

Heedless  to  say,  R**  also  minimises  qR(i). 
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3*  Denumerable  State  Case 

Let  us  turn  from  tha  finite  caaa  to  tha  case 
where  Z  it  danumarably  infinita  and  ask  ouraslvas 
whether  tha  basic  facts  as  put  forth  in  theorems  1  and  2 
remains  tha  same. 

Tha  following  modification  of  theorem  2  can  ba 
shown  (saa  [33  and  [6]  ). 

Theorem  2*.  If  <  •  .  i  €  I.  and  ■[  wilt  }  bounded, 
than  thara  exists  a  policy  R&  (  C"  which  minimises 
♦*  (i.«)  ,  1  €  I. 

If  Kj  ■  •  it  can  ba  easily  shown  that  thaoram  2* 
need  not  hold.  If  tha  {  J  are  not  bounded  it  can 
also  ba  shown  that  tha  result  does  not  hold  in  general 
(saa  [63).  Thus,  wa  might  ask  if  tha  conclusions  of 
thaoram  1  hold  under  tha  conditions  of  thaoram  2 1 . 

An  example  [113  has  bean  given  showing  that  tha 
conditions  of  thaoram  2*  do  not  guarantee  tha  conclusions 
of  theorem  1.  Moreover,  tha  example  shows  that  an  optimal 
policy  R  for  minimising  (i)  may  not  exist.  This 
counter-example  also  implies  that  thara  may  not  ba  a  policy 
R  €  C"  which  minimises  (i.a'  for  all  a  near  enough 
to  1.  as  in  tha  case  whan  I  is  finite.  For  if  such  ware 
tha  case,  it  would  ba  possible  to  show  that  gR  (i)  could 
always  ba  optimised. 
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A  more  surprising  counter-example  (see  (123  ) 
shows  that  an  optimal  policy  may  not  exist  in  C";  but  it 
may  in  C.  Thus,  if  restricted  to  stationary  policies,  a 
policy  involving  randomisation  may  prove  to  be  more 
effective  than  one  that  is  deterministic.  The  counter-example 
exploits  the  fact  that  denumerable  state  Markov  chains  may 
have  recurrent  null  states— a  property  denied  finite  state 
chains. 

An  even  more  surprising  counter-example  (see  [13]  } 
shows  that  cpR  (i)  may  be  minimised  by  a  policy  in 
C-C',  whereas  no  optimal  policy  exists  in  C'.  Thus,  in  order 
to  obtain  an  optimal  policy  one  may  find  it  necessary  to  go 
beyond  the  class  of  stationary  Markovian  policies.  This 
fact  seems  to  run  counter  to  one's  inflation  regarding  the 
problem  under  discussion.  The  literature  has  numerous 
remarks  asserting  the  reasonableness  of  assuming  that  an' 
optimal  policy  is  stationary. 

Mhat  develops  as  an  interesting  mathematical 
question  is  that  of  determining  the  weakest  conditions  under 
which  it  can  be  asserted  that  a  policy  R  €  C"  is  optimal. 

Xn  (12]  and  [143the  following  was  proved. 

Theorem  3-  »up£ose  {  wiJt  !  m  tainflffl-  JX  &t£ft  Wilt! 

bounds^  g,  ( V ^  },  j  €  X, 

satisfying 


(5) 


g  ♦  v. 


min 

Jk 


{ 


'ik 


+  f  «ij  W  vj  i « *• 


V 


5 

| 


i 
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than  thara  axiata  a  policy  R*  €  C"  which  ia  optimal.  The 
policy  R*  ia*  implement  deciaion  k  «  jft^  which  minimiaea 
the  right  aide  of  (5)  for  each  i  6  X.  Also. 


g  -  tPn*  (i)»  i  €  1. 

Note  that  (5)  ia  related  to  (l)  with  gi  *  g. 
Actually,  theorem  3  can  be  generalized  to  the  case  where 
the  g^.jy  are  not  all  equal  and  (l)  and  (2)  replace  (3). 

Conditions  implying  the  hypothesis  of  theorem  3 
can  be  given.  The  approach  ia  to  define-  a  policy  improve¬ 
ment  procedure  similar  to  the  one  discussed  in  the  finite 
state  case  and  show  that  in  the  limit  one  gets  a  policy 
R€  C"  satisfying  (3). 

We  define  a  policy  iteration  as  follows: 

Let  R  €  C"  be  gxven.  Assume  a  solution  gR,  {  v^R  },  j  €  I 

to  the  system  of  equations 

(6)  «R  +  Vj*  -  wiR  +  1  <ji:)(t}  VjR,  i  €  1 


exists.  (The  system  (6)  is  treated  in  [13]*) 
by  choosiug,  for  each  i  €  I.  that  decision  jfc  • 


minimises 


wik 


+  l 


j 


R 


Define  R1 
4^  which 


The  transformation  from  R  to  R'  is  the  policy  iteration. 
Note  that  the  policy  iteration,  here,  is  defined  more 
stringently  than  for  the  finite  state  case.  By  a  policy 


4  * 
*  * 
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improvement  procedure  we  mean  starting  with  an  arbitrary 
policy  Rq  €  C",  letting  Rn+1  denote  the  policy  obtained 
by  a  policy  iteration  on  Rn  ,  n  -  0,1,...  .  If,  for  any 

n,  Rn+1  *  Rfl  ,  then  equations  (5)  are  satisfied  and 
Rn  is  optimal.  Otherwise,  one  may  or  may  not  obtain  an 
optimal  policy  as  n-*».  It  is  of  interest  to  provide  con¬ 
ditions  under  which  {  Rn  }  converges  to  an  optimal  policy. 
Let  us  list  the  following  conditions. 

(b)  For  every  R  €C'‘,  the  associated  Markov  chain  is 
irreducible  and  positive  recurrent. 

(c)  For  every  R  €  C'*  there  exists  a  bounded  solution 

gR,  ^vjR  j  €  I  to  (6).  The  solutions  are 
uniformly  bounded  over  R  €  C*1. 

(d)  For  every  i  €  I,  inf  U.R  >  0  • 

R6C' '  1 

can  assert  (see  [1$]  ) 

Theorem  4.  If  <  e,  i  €  I,  (w^  }  ai£  bounded,  and 

(b),  (c),  and  (d)  hold,  tfras  bsUbl  towammi 
procedure  converges  to  an  optimal  policy  R*  €  C' ' . 

The  proof  of  theorem  4  involves  showing  that 
the  limiting  policy  does  yield  a  solution  to  (5).  Under 
weaker  conditions,  i.e.  (b)  and  (c),  it  can  be  shown 
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that  a  solution  to  (5)  axiata.  Therefore,  an  optimal 
policy  ia  in  C".  However,  it  ia  not  claar  that  a  policy 
improvement  procedure  will  convarga  to  an  optimal  policy. 

Conditions  ara  givan  in  [15]  guaranteeing  (c); 
slightly  weaker  conditions  may  ba  givan  in  [131 -  A  batter 
approach  to  tha  axiatanca  question  would,  in  all  likelihood, 
avoid  the  aquations  (5)  and  (6). 
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ABSTRACT 


A  broad  mathematical  framework  la  considered  that  includes 
stochastic  learning  Models  with  distance  diminishing  operators 
for  experiments  with  finite  numbers  of  responses  and  simple  con¬ 
tingent  reinforcement.  Convergence  theorems  are  presented  that 
sub sun  most  previous  results  about  such  models,  and  extend  them 
in  a  variety  of  ways.  These  theorems  permit,  for  example,  the 
first  treatment  of  the  asymptotic  behavior  of  the  general  linear 
model  with  experimenter-subject  controlled  events  and  no  absorbing 

I 

barriers.  8oms  new  results  are  also  given  for  certain  two-process 
discrimination  learning  models  snd  for  models  with  finite  state 
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1.  INTRODUCTION 


Suppose  that  a  subject  is  repeatedly  exposed  to  an 

experimental  situation  in  which  various  responses  are  possible, 

or  trial 

and  suppose  that  each  such  exposure ^can  alter  the  subject* • 

response  tendencies  in  the  situation.  It  is  assumed  that  the 

subject's  response  tendencies  on  trial  n 

are  determined  by  his  state  Sn  at  that  time.  The  eet  of 

possible  states  is  denoted  s  and  called  the  state  space.  The 

effect  of  the  nth  trial  is  represented  by  the  occurrence  of  a 

certain  event  B  •  The  set  of  possible  events  is  denoted 

"""  n 

B  and  referred  **  the  event  space.  The  quantities  s  and  B  are 

_____  n  n 

to  be  considered  random  variables.  The  corresponding  small  letters 

s  and  e  are  used  to  indicate  particular  values  of  these  variables, 
n  n 

and,  in  general,  s  and  e  denote  elements  of  the  state  and  event 
spaces,  respectively. 

To  represent  the  fact  that  the  occurrence  of  an  event  effects 

a  change  of  state,  with  each  event  e  is  associated  a  mapping  f^(*) 

of  s  into  s  such  that,  if  8  *  e  and  s  »  s,  then  S_ A ,  •  f  (s). 

n  n  n+i  e 

Thus 


"l  Vx  ■  \  <s„> 


for  n  a  1.  The  function  f^(«)  will  be  called  the  operator  for  the 
event  e  or  eimply  an  event  operator.  Throughout  the  paper  it  is 
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•MUMd  that 

It  la  further  ujppoaad  that  tha  learning  situation  ia  memory -leas 
pnd  temporally  homogeneous,  in  tha  sense  that  tha  probabilities 
of  tha  various  possible  avanta  on  trial  n  dapand  only  on  tha  state 
on  *rial  n ,  • nc  not  on  aarliar  states  or  events,  or  on  tha  trial 
nuafcer.  Tnat  is,  there  is  a  raal  valued  function  o.(«)  on  E  x  ' 
such  that 

H3  *  •j)  *  4U  (•)>  «nd 

p.(8n*i  "  -  •,»  1  *  j  *  n)  -  a  (f  (9)) 

a  n+1  n+1  j  j  •„+! 

for  n  i  1,  where 

*  .  (•)  *  *  (-“(f  (a)))).  (1.1) 

V’**n  *n  %-l  *1 

Throughout  the  paper  state  subscripts  on  probabilities  and  ex¬ 
pectations  are  initial  states,  that  is,  values  of  5^* 

Two  examples  will  be  discussed  in  Section  3*  a  linear  r.odel 
for  ordinary  two-choice  learning,  and  a  two-stage  linear  discrim¬ 
ination  l«  trning  model.  In  the  first  linear  model,  the 

etete  is  the  probability  of  one  of  the  responses,  so  s  •  [0,  1). 

In  tha  linear  discrimination  learning  model  the  state  is  e  pair 
of  probabilities  that  determine,  respectively,  the  "response" 
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probabilities  at  the  two  stages.  Thus,  g  *  [0,  1]  x  [0,  1]. 

In  these  exanples  each  event  involves  the  subject's  overt  response 
(suitably  coded),  the  observable  outcome  of  that  response  (i.e., 
the  experimenter's  response),  and,  sometimes,  a  hypothetical 
occurrence  that  is  not  directly  observable  (e.g.,  the  state  of 
attention  on  a  trial).  The  force  of  assunption  H3  for  the 
experimenter  is  to  limit  reinforcement 

schedules  to  those  in  which  the  outcome  probabilities  depend  only 
on  the  immediately  preceding  response,  that  is,  to  simple  contin¬ 
gent  schedules. 

The  research  reported  rn  this  paper  is  directed  toward 

understanding  the  asymptotic  behavior  of  the  stochastic  processes 

(s  )  and  (a  )  for  a  class  of  models  with  distance  diminishing 
n  n 

event  operators  to  be  defined  below  by  imposing  additional  restric¬ 
tions  on  the  functions  f  and  This  class  generalises  the  familiar 
linear  models,  and  the  latter  provide  much  of  the  motivation  for 
the  axioma  for  the  former. 

To  discuss  "distance  diminishing"  event  operators, 

it  is  necessary  to  assume  that  s  is  a  metric 
apace  with  respect  to  soma  metric  d.  A  formulation 

in  terms  of  Euclidean  space  and  root-sum-square 
distance  would  yield  sufficient  generality  to  cover  the  linear 
models  of  section  3.  such  a  formulation  would,  however,  restrict 
generality  without  any  redeesdng  simplification* 
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Nonovar,  a  treatment  in  tanu  of  geaeral  matric  spaces  highlights 

those  aspects  that  ara  crucial  to  tha  theory.  For  thasa  raaaons 
ably 

it  ia  assumed/\that 

B4  (8,  d)  is  a  metric  apaca. 

Tha  raadar  who  prafars  a  Euclidean  setting  can  easily 
specialize  most  of  what  follows  to  suit  his  preferences.  The  next 
assumption  is  suggested  by  the  linear  examples  of  section  3: 

H5  (S,  d)  is  compact. 

Tha  remaining  hypotheses  are  most  easily  stated  in  terms  of 
tha  following  notations.  If  V  and  g  are  mappings  of  s  into  the 
real  numbers  and  into  s,  respectively,  their  maximum  "difference 
quotients"  m(^)  and  u  (g)  are  defined  by 


whether  or  not  these  are  finite.  If,  for  instance,  S  is  a  real 
interval  (with  d(s,  s')  -  |s  -  s'|)  and  v>  is  differentiable  through¬ 
out  S,  m(V»)  is  the  supremum  of  |f<' (s)|.  The  hypothesis 

W  *(P#>  <  -  «  * 
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if  a  rare  regularity  condition.  The  next  two  assusqptions#  however, 
are  genuinely  restrictive! 


B7  p(f  )  *  1  tSL JL  ill  eel,  and 


h8  isim>  <  &  Haxi  It  a  b9»1Uyi  laUgn  *  nul  ibkxa.  ut  * 
mall  •  •  •  >  ek  lush.  itel 


where 


Mf  »  )  <  1  lad.  4>  .  €•>  >  o 

V  x  V- % 


?_  _  (a)  -  P  (*,  -  e.,  1  a  j  a  n). 

*1*  *  *  n  ■  J  3 


(1.4). 


Zn  H8  it  if  understood  that  the  integers  and  events  associated 
with  different  states  nay  be  different. 

The  inequality 


d(g(s),  g(s*))  a  u(g)d(s#  s') 


(1.5) 


for  Mappings  g  of  8  into  8  suggests  that  such  a  function  he  celled 

aitfnci  fllaLnlghlna  if  m-  (qr)  <  1  and  asxXs&x  fllittnca  dlalnlihlni 

if  ti(g)  <  1.  Hypothesis  H7  then  says  that  all  event  aerators  ere 
diatence  diminishing,  while  H8  says  chat#  whatever  the  present 
state#  sons  finite  sequence  of  events  with  strictly  distance  dish* 
inlshing  cumulative  effect  can  occur  on  subeequent  trials,  loth 
■7  and  H8  (with  k  •  1  for  all  states)#  ars  satisfied#  for  exasple# 
if  all  event  operators  are  strictly  distance  diminishing. 


Moraan 


8 


Zt  is  now  possible  to  introduce  the  following  preciiw  end 
convenient  terainology. 

Definition  1.1.  A  svstaa  ((8,  A),  1.  t.  e)  of  Mt»  and  fjnsvione 
jy  £  distance  diminishing  Model  (or  sieplv  £  Model)  if  f.  (•)  saps 
■  x  8  into  s,  e.  (•)  hop*  >  x  8  into  tig  non-negative  real  aaafet£&* 
2ec*  fe(#*  *  *•  IS482*  84  *  H5»  86  *  87  iOd  88  1SS.  satisfied. 

Definition  1.2.  stochastic  processes  tsj  and  (br)  laths. 
spaces  s  and  B,  respectively,  are  associated  •  ith  the  aodel  if  they 
satisfy  HI  and  H3. 
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2.  SURVEY  OF  RESULTS 

float  remarks  on  earlier  work  by  other  author*  will  be  deferred 
until  section  4. 

a.  Theorem*  concerning  state* 

The  process  [s^j  associated  with  any  distance  diminishing 
model  is  a  Harkov  process  with  stationary  transition  probabilities 
given  by 

K(s,  A'  *  ^  0e(»)  *  Ps*S2  *  **  (2.1) 

6:  f  (s)cA 


for  (Borel)  subsets  A  of  s.  The  n  step  transition  probabilities 
for  the  process  are  given  by 


K 


(n) 


(S.  A)  « 


Vs„.i  •  »>  ,2-2> 


for  n  *  1.  It  is  convenient  to  let  K^(s,  A)  be  1  if  a  <  A  and  0  otherwise. 
Functions  like  k  and  K^,  probability  nsasurss  in  their  second 

variable  for  each  value  of  their  first,  and  aeasurable  in  their 

first  variable  for  each  value  of  their  eecon<\  will  be 

called  stochastic  ks rne 1 ~ . 

A  basic  problem  is  the  asyep^totic  behavior  of  (a,  •)  as 


n  -»  ■*. 


Before  considering  this  question,  it  is  necesssry  to 


10, 


•pacify  v  *at  is  Mint  by  ‘convergence  *  of  •  sequence  iu  )  of 

IS 

-obabil ' tv  aaaaurai  on  H  to  a  probability  aaasura  u  on  a.  Tha 
jprupr  At  a  notion  as  'Jus  t  ur  converges  to  n  if  for  any  torsi 
reboot  A  of  8 

ji(A)  8  lie  inf _ u_  (A)  and  lie  aop_  <  u(A) 

n  ■■  n  vs 

•  « 

where  A  is  tha  interior  and  A  is  tha  closure  of  A.  if,  for  instance* 

•  is  a  real  interval,  suen  convergence  ia  equivalent  to  convergence 
of  distribution  functions  at  all  points  t*  continuity  of  tha  lisdt  — 
tbs  usual  notion  of  convergence  for  distribution  functions.  Tha 
extension  of  this  notion  to  stochastic  kerns Is  that  will  ba  used 
below  is  as  follows. 

BiCtolUoa  *•*.  1  nxmaei  (*n)  si  itwhuUs  lamU  eon- 
vurgas  uniformly  Jml  A  itWidltlC  kSXBtl  *m  it,  tSk  kSBL  fifiOl  IBfeMl 

*  Si  •  1B&  HU1  *  >  0,  there  &.  fH  latggqy  8  yai 

1  (•»  A)  -  f  S  K  {•*  A)  8  X  (s,  1)  ♦  « 

•  n  • 

for  all  aid  qnd  a  la  8. 

If  a  Uniting  stochastic  kernel  K^is,  A)  is  independent  of  s  for 
■11  A*  it  is  eoontiaes  natural  to  writs  X^CA)  instead  of  K^(s,  A). 
Aside  free  this  sbtge  of  notation  definition  2. 1  is  usaf factad. 

A  closely  related  problem  is  the  asyaptotic  behavior  of 
runotions  (sonants*  for  instance)  where  f  is  s  real 

veined  function  on  8.  two  notions  of  dcnwrgsnos  for  ssgseaoae  of 
real  veined  functions  on  s  ere  important  in  «bet  follows,  for  soy 
such  function  '  i>l  end  |yl  by 


lorau  i£. 


lr  1  ■  sup|7tt(.e^U. 

(2.2) 

set 

l7l  -  |7|  ♦  *(7>* 

(2.4i 

The  class  of  continuous  reel  valued  functions  on  •  ia  denoted  C(S) 

(not*  that  |r I  <  •  if  7  c  C(S)),  and  th*  aubclasa  on  which  *(7)  <  • 

(and  thus  S78  <  •)  ia  denoted  CL.  A  sequence  (7  )  of  functions  In 

C (8)  converges  uniformly  to  7  c  C(S)  if  |7ft  *  7|  -•  0  as  a  •*  •.  A 

stronger  notion  of  convergence,  applicable  to  functions  in  CL*  is 

l?n  "  7l  ■*  0  11  n  -•  ».  If  S  is  a  real  Interval  than  the  collection 

D  of  functions  with  a  bounded  derivative  is  a  closed  subset  of  CL 

in  the  sense  that,  if  «  D,  7  «  CL,  and  li^  |g  |7#  -  y|  -  0,  then 

7  c  O.  Since  If  I  *  |7|  +  |**|  for  any  f  <  D,  it  follows  that 
and  I7'  -7*|  -•  0 

|7#  “  7|  -*  0^»s  n  -•  •.  If  f#  and  •  0  fo r  all  e  c  S  and  if 

y  <  0,  then  I  (*(*)]  <  D  for  all  nil.  Thus  these  observations 
*  n 

are  applicable  to  7n<->  -  and  7B(«>  •  1. 

Theorem  1.1  gives  some  information  about  tbs  asymptotic 

behavior  of  (s  )  for  distance  diminishing  models  with  no  further 
n 

aeeusytione. 

jBsscs*  2-1'  isi  jsoot  thing  wtah  tia. 

tttaHiUg  *nrai  u/v>) K(j)  cgQYiraii  milmA y  u» 

tSL  a  A&flShllUS.  temi  *  »  for  any  horel  subset  A  of  1. 

Om  CQCltfcDt 

K  (•,  A)  «  CL.  There  is  f^C  <  •  such 

1*. 


(2.  S) 
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for  til  n  *  1  and  f  s  CL,  where 

B#[y{Sj]  -  *"(•#  da*).  (2.6) 

The  notation  g  [^<s  ) 1  for  the  expectation  of  f  with  reapect 
#00 

to  the  asymptotic  distribution  K  (s,  •)  is  not  meant  to  suggest 

that  there  is  a  random  variable  S  to  which  S  converges  with 

oo  n 

probability  1.  Though  such  convergence  occurs,  for  example,  under 
the  hypotheses  of  Theorem  2.3,  it  does  not  occur  in  general. 

Two  situations  will  now  be  discussed  in  which  the  conclusions 
of  Theorem  2.1  can  be  substantially  strengthened.  The  first  is 
characterized  by  the  loss,  asymptotically,  of  all  information  about 
the  initial  state;  the  second,  by  the  convergence  of  Sn  to  absorb¬ 
ing  states  with  probability  1.  Both  occur  frequently  in  mathematical 
learning  theory.  To  describe  hypotheses  that  lead  to  these  situa¬ 
tions,  it  is  convenient  to  have  a  notation  for  the  set  of  values  ' 
that  S  ,  takes  on  with  positive  probability  when  s.  *  s.  This 

set  is  denoted  T  (s) : 

n 

Tn(s)  *  {s'*  K(n)(s,  (s'))  >  0).  (2.7) 

An  absorbing  state  is,  of  course,  one  that,  once  entered,  cannot 
be  left;  that  is,  K(s,  {a})  «  1  or  T^a)  »  (s).  Another  convenient 
notation  is  d(A,  B)  for  the  (minimum)  distance  between  two  subsets 


A  and  B  of  Si 
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inf  d(a,  a*) 
sca ,  a'eB 


(2.8) 


If  B  is  the  unit  set  (b),  then  d(A,  B)  is  written  d{A,  b) . 

Theorem  2.2  shows  that,  to  obtain  aaynptotic  independence  of 
the  initial  state,  it  suffices  to  assume  that 


H9 


lim  d(T _(»)>  T  (s*)) 
n  n 

n~#» 


0  for  all  s,  s'  e  s. 


Theorem  2.3  shows  that,  to  obtain  convergence  to  absorbing  states, 
it  suffices  to  assume  that: 

H10  There  are  a  finite  number  of  absorbing  states  a^,  ...,  aR, 
such  that,  for  any  b  e  s,  there  is  some  a^  ^  for  which 

lim  d(T  (s),  a  . )  «  0. 
n-*»  3  1  1 

It  is  easy  to  see  that  H9  andHIOare  inconsistent  except  when  there 
is  exactly  one  absorbing  state,  in  which  case  they  are  equivalent. 

Theorem  2.2.  If.  a  distance  diminishing  model  satisfies  H9, 
then  the  asymptotic  distribution  X^s,  •)  *  k*°(» )  does  not  depend 

(n\  oo 

on  the  Initial  state  s,  and  K  converges  uniformly  to  x  .  There 
are  constants  a  <  1  and  C  <  «  such  that 

llB.U<sn)]  -B[lfr(Sw)]||  *  can||fj| 
for  nil  and  i  e  CL,  where  bU(S  )1  »  /_*( s)x"(ds). 

1  1,1  ®  S 


(2.9) 


J 


Norton  14.  y 

Theorem  2.3.  if  i  fliiamsi  diminishing  model  satisfies  H10, 

than  Sha.  stochastic  BSSfillft  CSr)  converts  with  probability  1  to 

&  random  absorbing  jrfeaJa.  Sn.  £or  any.  Him,  the  function 

if  b. ,  ,  b  juc ft  xmlL  juiabaxa,  „ 

*s(s«  *  mi)  fegAsasa.  to  c£.^ml  ftmatlaa  r(*)  -  lailm 

of  aquation  BgO  (S2)  ]  ■=  y  (a)  that  has  the  boundary 
value  a  7  (a  ^ )  ■  ly  The  stochastic  kernels  converge  uni¬ 

formly  to  k",  and  k“(s,  •)  assigns  weight  ^(s)  to  a^,  so  that 

«.t*(S.)l  -  I?.,  ?,<•  )Tp(a.) .  There  are  a  <  1  and  C  <  *  such  that 
■  ®>  1*1  ii  " 

||B  C^(8  )  ]  -  B  C^(S  >111  S  can\\f\\  (2.10) 

.  n  ■  ® 

for  all  nil  and  y  €  CL. 

These  theorems  suggest  the  following  terminology! 

Definition  2.2.  A  distance  diminishing  model  is  ergodic  if 
it  satisfies  H9 ,  and  absorbing  if.  it  satisfies  H10. 

Note  that,  whereas  in  Theorem  2.1  only  the  convergence  of 
Cesaro  averages  is  asserted,  in  Theorem? 2. 2  and  2.3  the  sequences 
(K^J  and  {B.[iMSn)1}  themselves  converge.  It  is  also  worth 
pointing  out  that,  although  it  is  of  little  importance  that  (2.9) 
and  (2.10)  imply  HbJ^MS  )  ]  -  E  [*'(S  )]||  -»  0  instead  of  simply 

*  n  «oo 

|B.i>(Sn)]  -B<[iMSeo)]|  -♦  0,  it  is  of  considerable  importance  that 
these  formulas  give  a  geometric  rate  of  convergence,  independent 
of  1/  as  long  as  |j^j|  is  less  than  some  fixed  constant. 

Proof i  of  Theorems  2.1  -  2.3  are  given  in  Section  5.  The 
main  tool  used  is  the  uniform  ergodic  theorem  of  I one sc u  Tulcea 
and  Marinescu  (1950) .  The  results  given  above  do  not  exhaust 


7  A*)  -  J 

SfiOUfiBfiUft 

solution 
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the  implications  of  this  theorem,  even  for  distance  diminishing 
models,  as  will  ba  saan  in  saetion  5. 

b.  Thaoraaa  concerning  B vents 

considar  sons  charactaristic  c4  that  partains  to  J  consecutive 

avanta,  i  a  1;  e.g.,  "response  R  occurs  on  trial  nM  (i  »  1),  "the 

rasponsas  on  trials  n  and  n+1  differ*  (1*2),  or  t>utcoae  0  occurs 

on  trial  n  and  response  r  on  trial  n+1"  (i  -  2).  It  is  often  of 

interest  to  know  tha  asymptotic  behavior  of  the  probability  that 

(B  ,  . ..,  B  ,  has  tha  property  C4.  Let  B4  ba  tha  aat  of  n 
n  n+i-i 

tuples  of  events,  and  let  A4  ba  tha  subnet  of  B4  that  corresponda 
to  c4;  that  is,  A1  »  ( ...,  e^t  (e^,  ...,  e^)  has  tha  property 
C4}.  Then  it  is  tha  asymptotic  behavior  of 

P*n)(A4)  -  P,(<ln,  Mn^x)  *  A1)  (2.11) 

that  is  in  question.  Theorem  2.4,  which  applies  to  both  ergodic 
and  absorbirg  models,  gives  much  information. 

ThBOIlff  2.4.  xai  JOY  trarttg  SI  IteiOrt^ng  *e*S±  there  is  an 
L  <  •  such  that,  for  anv  is  1  and  A4  C  b4, 

»P.(n)(A4)  -  p7(A4)B  «  La"  (2*12) 

£££.  ill  n  »1.  vAtltt 


P^  (A4)  -  /8  P4**  (AJ)K*(s,  ds‘), 


(2.13) 


Moreen  16. 

M4  ®  iM.  MM.  iSL  (2.9)  and  (2.10). 

pp  # 

In  the  argodic  case  the  subscript  i  on  f#(A  )  can,  of  course, 
be  dropped. 

The  following  corollary  for  absorbing  models  is  vary  useful. 

Corollary  2.5.  If  an  absorbing  and  »n  a1  C  sJ  have  the 

(1)  i 

property  that  P  (A  )  -  0  for  i  ■  1,  ...,  N,  then  the  total  nmrih^r 

*  | 
x  at  intifltri  "  I2L  nhish.  (*n,  •••,  zn+j.^  «  A  is  finite 

with  probability  1,  and 

||S.[X]||  S  Lo/(l  -  a).  (2.14) 

DM.  function  x<»>  -  «sfxl  ix.  £2*  unique  continuous  solution  g£  Dm. 
equation 

x(s)  -  p*1^1)  +  *#U(s2)l 

for  which  X(*i)  »  0,  i  «  1,  ...»  N. 

The  next  theorem  concerns  ergodic  models,  and  requires  some 

additional  notation  for  its  statement.  Let  h  be  a  real  valued 

function  on  B.  Then  the  asymptotic  expectations  of  h(Bn)  and 

h(l  )h(B  ..)  are  denoted  x(h(B  )]  and  B(h(B  )h(B  .)],  respectively, 
n  n+]  •  •  •t] 

Thus 


*[h(Bj]  -  I  h(e)P  ((e)),  and 

etS 


(2.15) 


«(h(*Jh(I  )]  -  I  h(e  )h(e.  )p"(((e  ,  ...,  «.  )}). 

•r**Vi  3 


(2.16) 
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In  typical  applicatlona  h  will  be  tha  indicator  function  of  aome 

A  C  E.  bo  that  h(B.)  ia  tha  nuofcer  of  occurrancaa  of  events 

jm  j 

in  A  during  the  block  of  n  triala  beginning  on  trial  a.  In  thia 
case , 

E[h  (B)]  =  p"(A), 

00 

E[h(E  )h(E  .)]  »  P°°(A  x  A),  and 

CO  ®°+l 

B[h(B  )h  (E  . )  ]  =  p"(A  x  JS*”1  x  A)  for  j  *  2. 

oo  oH  j| 

Theorem  2. SJbL  oraadifi  BOd  BOX.  £lll  valuod 

function  h  sa  B,  series 

E[h2(B  )]  -  E2[h(E  )]  +  2£  (BCh(B)hte  .)!  -  E2[h(B  >1)  (2.17) 

48  00  j-1  * 

2 

ssnxsxais.  ia  a  aafl-nsaitiya.  aantiaai  °h* 

(ii)  For  some  ®  iai  all  m,  n  *  1 

|i  -  nlChff.oV]  -  °2|  «  Chn"1/2.  <J.W> 

i  -  j»rri  J  '  -  1 

ggnmaiflUy*  On  law  Of  large  numbers 

li»  BJf  i  z  “  B[h(l  )])  1  -  0  (2.19) 

n—  j-m  3  *  '  J 

telfli  mlianaii  ia  »• 

(ill)  If  c2  ^  0.  the  central  limit  theorem 
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m*n~l 

,  2  h(l  )  -  n*[h(*J] 

ii-  » .(  i  J> .  7 -  <  *) 

n-«»  vn  o.  7 

n 

la  valid  for  all  a  e  8. 

A  distance  diminishing  modal  can  bo  regarded  as  an  example  of 
what  Iosifescu  (1963)  calls  a  homogeneous  random  system  with  com- 
Pll-tl  connections.  Theorem  2.6  ia  a  consequence  of  Theorem  2.4  and 
a  theorem  of  Iosifescu  on  such  ay sterna.  Results  in  this  subsection 
will  be  proved  in  Section  6. 


v* 


/; 


exp(-t  /2)  dt  (2.20) 
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3.  EXAMPLES 

The  example*  to  be  discussed  have  been  selected  so  as  to 
illustrate  a  variety  of  ramifications  of  the  theory  developed  in 
Sections  1  and  2. 

a.  LiM.it  Models  vyjth  Experiroentsr-Subiect  Controlled  Events 

Suppose  that  the  subject  in  a  learning  experiment  has  response 

alternatives  A^  ana  A^  on  each  trial;  and  that;  following  response 

A^(  one  of  two  observable  outcomes  and  0^2  occurs.  It  is 

assumed  that  0^  and  0^  positively  reinforce  A^,  in  the  weak 

sense  that  they  do  not  decrease  the  probability  of  A^.  The  outcome 

probabilities  are  supposed  to  depend  at  most  on  the  most  recent 

response.  Let  A  and  0. .  denote,  respectively,  the  occurrence 

of  A.  and  0  on  trial  n,  and  denote  the  prc&bility  P (O  |A.  ) 

x  ij  i]#n  *#» 

by 

Linear  models  with  experirasnter-subject  controlled  events 

(Bush  and  Mosteller  (1955))  for  this  situation  can  be  described 

within  the  framework  of  Section  1  by  identifying  p  ,  the  (condl- 

n 

tional)  probability  of  A  ,  with  the  state  S  ,  by  identifying 

A|M  n 

the  rs sponse -outcome  pair  that  occurs  on  trial  n  with  the  event  I  , 

n 

and  by  making  the  following  stipulations! 


s  -  Co,  ii,  d(p,  p* )  »  |p  -  p' I, 


(3.1) 


U»  j)  -  (AA,  otj)  end  *  -  C  (i,  j)»  1  •  i,  j  *  2), 


0.2) 


4 


/ 

■o  rnan  30* 

ftj(p)  -  U  -  eAj>p  +  0.3) 

fAj(p)  -  <PCA1  +  U  “  P)Bi2)Tij*  (3.4) 

Wil  +  Ti2  "  A*  •H-  0  *  ®Aj*  *  1  for  1  *  i,  j  *  2.  (3.5) 

In  (3.3)  and  (3.4),  «j.  is  tha  Kronacker  6. 

For  convnnlsnca,  any  systan  ( (S,  d),  I,  f#  a)  of  aats  and  functions 
satisfying  (3.1)  -  (3.5)  will  bn  rafarfad  to  as  a  four -ocara tor 
nodal.  In  this  terminology,  (3.1)  -  (3.5)  dafina  a  six-paranstar 
family  of  four-oparator  nodal a,  ona  for  aach  choica  of  0^,  0^, 

021'  6  22*  ^11*  *22  con-i,t*nt  with  (3.3).  Sinoa  nta^)  * 

and  ud^)  ■  (1  “  ®Aj)  *  1*  it  is  claar  that  any  four-oparator 
nodal  satiafias  all  of  tha  conditions  of  Oaf in it ion  1.1  axcapt 
parhaps  *• 

Tha  asymptotic  bahavior  of  tha  procass  (pR)  aaaociatad  with, 
a  four-oparator  nodal  da panda  critically  on  tha  nuabar  of  absorbing 
statss.  Leans  3.1  cataloguas  tha  absorbing  atataa  for  a  four- 
Oparator  nodal. 

IMM  3.1.  Tha  a  tats  1  X*  absorbing  if  and  only  if  »  0 

at  #ia  ••  o.  a a.  ttMStL  SL  it  itemtolM  it  aaA  aoix  it  *n  -  °  at 
®21  -  o.  *  p  c  (o,  i)  i«.  ihtttbiM  11  art  anlx  U  Aut  tart 
U#  )>  ‘  ^  *  J  at  -  0.  Itt.  Udi  filflp  111  atataa  m 

absorbing,  and  ]Jn  nodal  is  said  to  ha  trivial. 


Xorman 


n. 


Proof.  A  state  p  c  (0,  1}  la  absorbing  if  and  only  if,  for 
any  (i,  j)  c  S,  slthsr  f ^ (p)  -  p  (in  which  caaa  0^  •  0  sad 


f4 j  (x)  ■  x)  or  e^(p)  ■  0  (in  which  csss  r^ 


0  and  g  ^  (x)  »0). 


Tha  atata  1  is  absorbing  if  and  only  if  1  -  »  f^U)  •  1 

or  »12  -  o12(l)  •  0.  Tha  assartion  conoaming  tha  stats  0  is 
proved  similarly.  Q.I.D. 

Tha  naxt  lamina  tails  which  four-operator  aodals  satisfy  B8. 
hawM  3.2.  &  xasxresu&Sai.  jaadti  JLjt  diiunct  fllainltfilmr  il 


BOA  ftfllV  ii,  Xttt  ftASUl  i  «  (1#  2}*  thara  it 
JL&B&  6.,  >  o  AQl  >  o. 


«  Cl,  2) 


1J, 


Proof,  tuppoaa  that  tha  condition  given  by  tha  loan*  is  aat. 
If  p  >  0  than  (p)  •  prt4  >  0  and  n(fl4  )  -  1  -  #,4  <  1. 


lj 


Hi 


Hi 


Similarly,  if  p  <  1  than  (p)  >  0  and  ti(fM  )  <  1*  Than  M  is 


2j 


*1 


satiafiad  with  k  •  1  for  all  statas. 

Sappoaa  that  tha  condition  falls.  Than  for 


i  «  Cl,  2) 


and  all  j  «  Cl,  a),  6^  •  0.  or  r ^  •  0.  Slnoa  tbs  eaaas  i  •  1 
and  1*2  can  be  treated  similarly,  only  i  •  l  will  ha  oonaldarod. 
It  follows  from  Lamms  3.1  on  taking  J  •  a  that  1  is  an  absorbing 
state,  thus  f  _  (1)  >  0  inpliss  m.  •  1  and  r.  >  0, 

Vr  •••*  Vk  1  U1 


1  f  i  l  k.  lot  than  • 


C- 


0  for  Hill  and  u  (f 


Vk*1"**  \% 

o.t.o. 


>  •  i. 


is  not  satisfied 


•Sorman 


22 


Claarly  a  distanca  diminishing  four-oparator  modal  is  non-trivial. 

With  om  lnconsaquantial  sxcsptioc,  distanca  diminishing 
four  opsrstor  mods la  srs  sithsr  srgodic  or  absorbing.  Thsorsm* 

3.1  and  3.2  show  slightly  mors. 

SaSOa  3.1.  U  oslthsr  0  qsl  1  it  ISJL  * 

gaoto  adti*  Oak  >  o  tt&  rAj  >  c  isi  a  *  3.  ssfi.  m&l 

it  distinct  flflrfildfMffl-  titbit  (i)  -  1  an£  r  ■  1  it  i  /  j* 

si  Oa  *4tl  it  sise*i&- 

x&tssat  3.2.  u  t  SiiitRgt  tiateia&ai  aflti 

Mi  «a  afetprfeins  Om  it  it  m  jMs&tet  sdtl- 

Tbs  bshavicr  of  ths  p rocs os  (p J  whsn  6.  •  1  and  r  ■  1  for 

»  ‘3  ij 

i  ¥  j  is  eoaplataly  transpsrsnt.  starting  at  p  ths  proossa  movaa 
on  its  first  stap  to  1  with  probability  1  -  p  ami  to  0  with  prob¬ 
ability  p,  and  tharsaftar  sltaraatas  batwasn  thasa  two  art ram* 
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state*.  This  cyclic  modal  is  of  no  psychological  interest  and 

will  be  discussed  no  further. 

Proof  of  Theorem  3.1.  By  Le  ms  3.1  if  neither  0  nor  1  is 

absorbing  then  ft. .  >0  and  w. .  >0  for  i  /  1,  and  the  model  is 
13  1} 

distance  diminishing  by  lerana  3.2. 

Suppose  <  l.  Then  by  considering  first  the  case  p  *  0, 

then  p  >  0  and  ft^2  *  1,  and  finally  p  >  0  and  0  <  1,  it  is 

seen  that  (1  -  ft.  )np  e  T  ,p)  for  all  n  «  1.  Thus  d  (?  (p) ,  T  (q) )  a 
i  2  n  n  n 

(1  *  e12)n  l|P  “  ql  "*  0  **  n  -*  <»,  and  the  model  is  ergodic  accord¬ 
ing  to  Definition  2.1.  By  symnetry  the  same  conclusion  obtains 
if  <  1.  Suppose  that  0^  2  <  1.  Then  (1  -  G12)np  C  Tn^  for 

all  p  >  0  and  n  5  1,  and  (:  -  ft,  _)n  1G„.  €  T  (0)  for  all  n  i  1, 

12  21  n 

Since  both  sequences  tend  to  0,  ergodicity  follows.  The  same  conclusion 
follows  by  symmetry  when  0^  <  1.  Thus  if  (i)  does  not  hold 
the  model  is  ergodic.  Q.B.D. 

Proof  of  Theorem  3.2.  The  condition  given  by  bemme  1.2  for 
v  (our-operator  model  to  be  distance  diminishing  allows  four  pos¬ 
sibilities.  These  ere  distinguished  by  the  values  of  3^,  i  ■  1#  2t 

A*  ix  -  1.  -  1;  S*  jA  -  2.  «  2;  C»  3X  *  l,  j2  •  2;  and 

Dt  j  *  2»  32  »  1.  Learns  3.1  shows  that  D  is  inconsistent  with 

the  existence  of  absorbing  states.  Thus  it  remains  to  show  that 

a  aodel  is  absorbing  under  a,  t,  or  C  if  there  ere  absorbing  states. 

Under  A,  I  -  (1  -  0,.)n(l  -  p)  «  T  (p)  for  ell  n  *  1  end 

21  n 

0  i  p  i  1,  so  d(Tn<p),  l)  »  (1  -  «2i)n  -«0i»n-*«.  This  iapliae 
that  0  is  not  an  absorbing  state.  By  assumption,  however,  there 
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is  at  least  on*  absorbing  stats ,  so  1  is  absorbing.  But  than 

iim  d (T  (p),  1)  -  0  for  all  0  *  p  i  1  implies  that  the  nodal  is 
n-#»  n 

absorbing.  By  uynnstry  tha  nodal  is  also  absorbing  undar  b. 

If  0  is  net.  absorbing  >  0  and  ©21  >  0  by  Lassos  3.1.  Thus, 

if  c  holds,  A  iocs  also,  and  tha  modal  is  absorbing,  if  c  holds, 

and  1  is  not  absorbing,  the  same  conclusion  follows  by  syunatry. 

Condition  C  implies  that  (l  -  9_.)np  e  T  (p)  for  p  <  1,  1  - 

zi  n 

(1  -  011)n(l  -  p)  €  Tn(p)  for  p  >  0,  and  0^,  ©22  >  0.  Thus  if 

both  0  and  1  are  absorbing,  HID  is  satisfied  with  j (1)  •*  1,  j (0)  *  0 

and  j (p)  *  1  or  0  for  0  <  p  <  1.  Q.E.D. 

As  a  consequence  of  Theorems  3.1  and  3.2  all  of  the  theorems 

of  Section  2  for  ergodic  models  are  valid  for  non-cyclic  four-opera 

models  without  absorbing  states,  and  all  theorems  of  section  2  for 

absorbing  models  are  valid  for  distance  diminishing  four-operator 

models  with  absorbing  states.  A  few  illustrative  specializations 

of  the  theorem: of  Section  2  to  the  case  at  hand  will  now  be  given. 

The  first  concerns  convergence  of  the  moments  B  [pV]  of  the  process 

p  n 

On1' 

Theorem  3.3.  For  any  non-cyclic  distance  diminishing  four- 
o para tor  model  there  are  constants  C  <  »  and  a  <  1  such  that 

ilB.Cp^]  “  B.[p*lll  *  c  <v  +  Da11  (3.6) 

for  all  raal  v  a  1  and  positive  integers  n.  The  function  B.[p*] 

lux.  a  feamflid  flailaUa.- 
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This  is  obtained  from  (2.9)  and  (2.10)  by  notin?  that  the 
function  ^(p)  *  pV  belongs  to  D  with  |^j  *  1  and  m(^)  *  |^!|  *  v, 
so  that  [|^||  *  v  +  1. 

If  0  is  the  only  absorbing  state  of  a  distance  diminishing 
four-operator  model.  Theorem  2.3  implies  that  lim  n_W0Pn  *  0  with 
probability  1,  whatever  the  value  of  p^.  It  is  conceivable,  how¬ 
ever,  that  the  convergence  is  sufficiently  slow  that  the  total 
number  X  of  A^  responses  is  infinite  with  positive  probability. 
Furthermore,  even  if  X  is  finite  with  probability  1,  it  might  have 
an  infinite  mean.  Similarly,  even  though  p^  converges  to  0  or  1 
in  the  case  of  two  absorbing  states,  a  priori,  considerations  do 
rot  rule  out  the  possibility  that  the  total  number  Y  of  alterna¬ 
tions  between  responses  is  infinite,  or  barring  that,  that  its 
mean  is  infinite.  Theorem  3.4  axcluues  these  possibilities. 

IheQXftffi  3.4.  ii  o  is.  gnjy.  afcgoj&jjig.  statg.  o £  &  distance 
4imi.ni,shiiig.  four -operator  model .  then  X,  tjjg.  total  nasfefi£  of 
responses^  is  finite  with  Probability  1,  and  lis  [x)H  <  «.  IX  both 
0  1  MS.  absorbing  states  o£  a  distance  diminishing  four -opera  tor 

SB&Ju  then  Y,  the.  £otal  number  Of  alternations  frftwegft  responses. 

is.  iinifcs  nistfc ability  i  aM.  i(E.  CyJ  y  <  ». 

Naturally  the  first  assertion  is  still  true  if  1  replaces  0 
as  the  only  absorbing  state  and  X  is  the  total  number  of  A^ 
responses. 
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Proof, 
defined  by 

B  -  {(1,  1),  (1,  2))  and  (3.7) 

D  -  { C (i,  j),  fk,  i))t  i  /  k). 

Than  B  e  B  if  and  only  if  A.  occurs  on  trial  n,  and  (B  ,  B  , )  £ 
n  1  n  n+x 

D  if  and  only  if  there  is  a  response  alternation  between  trials 

n  and  n+1.  since  (B)  *  0,  and,  if  both  0  and  1  are  absorbing, 

P ^ (D)  »  p^ (D)  *»  0,  the  conclusions  of  the  theorem  follow 

directly  from  Corollary  2.5.  Q.B.D. 

If  h  is  the  indicator  function  of  the  subset  B  of  B  given 

by  (3.7),  then  A  *  h(E  )  is  the  indicator  random  variable  of  A. 

n  n  ipH 

and  x®*"-1  a ,  is  the  frequency  of  A.  in  the  block  of  n  trials 
j-m  j  1 

beginning  on  trial  m.  Theorems  3.1  and  2.6  yield  a  law  of  large 

numbers  and,  perhaps,  a  central  limit  theorem  for  this  quantity 

for  any  non-cyclic  four-operator  model  with  no  absorbing  states. 

The  full  power  of  this  result  cones  into  play  when  the  quantities 

2 

e[a  ]  *  lim  P  (A.  )  and  a  can  be  computed  explicitly  in  terms 

00  n-#»  p  x,n  h 

of  the  parameters  of  the  model.  This  is  the  case,  for  instance, 
when  all  6^  are  equal. 

Theorem  3.5.  £  four-operator  model  with  0^  *  0  >  0  for  1  * 

i,  j  *  2,  and  tt >  0  for  i  /  j,  but  not  0  -  ir12  ■  ir  “ 
ergodic.  The  law  of  large  numbers 


26. 


Let  B  and  D  be  the  subsets  of  B  and  8  ,  respectively. 


1. 
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lira  8  [  ( (l/n)S  „  -  /)  ]  =  0 
n-*»  p  "»'n 


(3.8) 


and  central  limit  theorem 


lim  P 
n-*o 


k 


S  -  ni  s  irx  2 

- -'■')*  -  /  exp(-t/2)dt, 

7  — 


^  sTn 


<  * 


(3.9) 


hold.  Where 


1  =  +  V' 


(3.10) 


2  , 


|  n  .  TT  , - 2(l^  -_e) - ; 

ir2l  +  tt12  L  11  22  +  (2  -  9)  +  2(1  -  ^  -  ir^)  (1  -  9)J 


,  (3.11) 


and  s  is  the  total  number  of  A.  responses  in  the  n  trial  block 
-  m,n - 1  - - * - - 

beginning  on  trial  m. 

Outline  of  proof.  Ergodicity  follows  from  Theorem  3.. I,  so 
Theorem  2.6  is  applicable  straightforward  computation  yields 


E(pJ  »  E[A  J  «  l, 


(3.12) 


E[pf]  -  l2  +  Bf(  1  -  l)/{(2  -  0)  +  2(1  -  *u  -  ^22)(1  -  «)),  (3.13) 


afA»Vl1  =  (1  ‘  G,8[pJ  +  and 


(3.14) 


'tVWk1  -  -  <l  -  9<\l  +  V)’‘"1(,!tVlA.1  -  <3-15' 
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2  2 

for  1c  l  1.  These  formula*  pannit  computation  of  a  ■  c^,  the  aariaa 
in  (2.17) .  Th*  result, racordad  in  (3.11),  ia  positive,  tinea 
9  <  t  <  1,  and  aithar  +  T22  ^  0  or  >  °*  Q.B.D. 

The  aquality 

M(p)  -  p  *  (v12  +  ir2i)  (f  -  p), 

where 


M(p) 


PfO..  or  0,.  \pn 
11, n  21, n  n 


P>. 


shows  that  the  asymptote  £  of  A^  response  probability  for  the 
lines'-  mo  lot  with  equal  9*  s  is  associated  with  Che  asymptotic 
equality  of  the  probability  of  and  the  probability  of  reinforce¬ 
ment  of  AjJ 


lim  P(0 
n-#» 


11, n 


or  °21,n> 


lim  P(A 
n-e> 


l.n 


). 


Such  probability  matching  is  a  well  known  prediction  of  the  linear 
nodal  with  equal  9' s.  Theorem  3.5  contains  a  much  stronger  predic¬ 
tion.  The  law  of  large  numbers  (3.8)  asserts  that  the  proportion 

(1/h)  3  of  A.  responses  for  a  single  subject  in  a  long  block  of 
in  j  n  1  *** 

trials  is  close  to  i  with  high  probability.  The  terms  ■'close*  and 

"high*  are  further  quantified  by  the  central  limit  theorem  (3.9). 

To  illustrate,  if  reinforcement  is  noncontingent  with  ir^  »  *  .75 

2 

and  9  is  small  (that  is,  learning  is  slow),  then  a  A  2tt (1  -  it)  - 
.375  so  that,  in  a  block  of  400  trials  commencing  on  trial  100,  the 
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probability  is  approximately  .01  that  (1/400)S10(J  40Q  will  depart 
from  .75  by  as  much  as  (2.  ri8)  (,612)/20  =  .079. 

There  is  one  modification  of  the  four-operator  model  examples 
of  which  have  occurred  sufficiently  frequently  in  the  literature 
(see  Estes  and  Suppes  (l95y),  Norman  (1964),  and  Yellott  (1965)) 
to  warrant  comment  here,  rf,  following  any  of  the  outcomes  O 

ij 

conditioning  is  a.'ss’.nel  t  >  be  effective  (i.e.,  p  *  f.  (d  )) 

n+1  ii  ' 


with  probiioility  ■'  ar.; 

O 


otherwise,  ineffective  (i.e.,  p  ,  «  p  ) 

*n+i 


3  five -operator  model  is  obtained.  It  is  easy  to  amend 

(3.2)  -  (3.5)  to  obtah  a  formal  description  within  the  framework 
of  Section  1.  such  an  addition  of  an  identity  operator  does  not 
affect  the  valid- *•-_  of  any  of  the  results  preceding  Theorem  3.5 
(or  their  proofs)  provided  that  *  j.s  everywhere  replaced  by 

The  first  sentence  o *'  the  amended  Theorem  3.5  should  read; 


A  five -operator  mouel  with  =  0  >  0  and 
j  s  2 ,  and  ?r^  >  0  for  i  j/  j ,  but  not  6  =  c 

ergodic .  Also  (3.il)  should  be  replaced  by 


c  >  0  for  1  s  i. 


*12  *  *21  ”  it 


2  _  ui  -  ti  r 


a  = 


*21  +  * 


12  *■  11 


+  r  + - UL^SSl - 

22  c  (  (2  -  0)  +  2(1  -  ttxi  -  tt22)(1  -  8)) 


]• 


(3. 


and  8  should  be  repiiiced  by  cp  in  (3.14)  and  (3.15).  An  interesting 

2 

implication  of  (3.16)  is  that  <  •*,  whereas,  if  c8  <  1, 

2 

lin^o  *  "•  Thus  the  variance  of  the  total  nusfcer  of  A^  responses 
in  a  long  block  of  trials  nay  be  useful  in  deciding  whether  a  given 
instance  of  "slew  leaminrr"  is  due  to  small  9  or  small  c. 
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b.  Low  joy  '  s  &3Sk  I 

Low joy's  (1966)  Nodsl  X  is  a  sisple  nods I  for  simultaneous 

dl scrladnat Ion  looming*  Lot  ths  rolovant  stimulus  dimension 

bo  brightness,  and  lot  white  (w)  bo  positiw  and  black  (B)  bo 

negatiw.  On  each  trial  tho  subject  is  supposed  either  to  attend 

to  brightness  (A)  or  not  (A),  which  ewnts  haw  probabilities  Pr(A) 

and  1  -  p  (A),  oiwn  A  the  probability  o l  the  response  appropriate 
n 

to  white  is  ?n(w|A),  while  giwi  A  the  probability  of  this  response 

is  1/2.  The  subject's  state  on  trial  n  is  then  described  by  the 

wetor  (p  (A) ,  p  (w|a))j  and  the  state  space  is 
n  n 

8  »  ( (p,  p')«  0  «  p,  p*  *  1). 

This  is  a  compact  metric  space  with  respect  to  the  ordinary 
Buclidean  metric  d. 


The  ewnts  are  the  elements  of 

B  -  ((A,  W),  (A,  B),  (A,  W),  (A,  B) ) , 
the  corresponding  transformations  are 

fAW(p*  p,)  "  <°lp  +  (i  ”  aih  °3P'  +  (1  ~  a3>>* 

fAB<P'  p,)  "  a4p*  +  (1  “  °4*>i 
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fXw*p'  P'J  "  {0ip*  p'*» 

fXBtP*  p,)  “  <a^  +  (1  “  a2*'  P*>' 
whara  0  <  a  ,  a^,  <  1,  and  thair  probabllitlas  ara 

p,)  "  PP'» 

fAB<P*  p,)  “  p(1  "  P*>» 

<P^W(P»  P* )  -  (1  “  p)/2,  and 

^(P.  P')  -  (1  ~  P)/2. 

Any  system  ((s,  d) ,  E,  f,  of  seta  and  functions  satisfying  the 
above  stipulations  will  be  called  a  discrimination  node!  of  type  I 
below. 

Theorem  3.6.  Any  discrimination  model  of  type  1  is  distance 
diminishing  and  absorbing  with  single  absorbing  State  (1,  1). 

Proof.  Axiom  H6  is  satisfied  because  of  the  continuous 
differentiability  of  the  g's,  and  H7  follows  from 

MfA|f)  •  maxfc^,  o3)  <1, 

Mf^)  -  max(a2,  o4)  <  1,  and 

U(V  -  V 


Norman 


32. 


.flue  it  remains  only  to  verify  H8  and  BIO. 

Bota  that#  aa  a  consequence  of  (1*5)#  for  any  mappings  f  and 
g  of  0  into  8  auch  that  }t  (f)  <  •  and  |i  (g)  <  «#  tha  ina quality 

n(f  •  g)  <  n(f)u(g) 

ohtaina#  where  f  •  g(a)  -  f(g(a))«  Tbia  implies  that  |i(f  )  < 

V*#th 

1  if  a^  ■  (A,  w) .  Now  <pAW  (p,  p* )  >  0  throughout 
8*  ■  C (pi  P')«  P  >  0,  p*  >  0), 


ao  to  complete  tha  varification  of  B8  it  aufficaa  to  ahow  that  if 
(p,  p')  c*  3'  tharo  ia  a  k  t  2  and  there  are  a van t a  e^#  ...#  a^_^ 
*  .  (P*  J')  «  »'  »»<*  m  (P»  P*)  >  0. 

V"Vi  V**Vi 


auch  that  f  (p,  y*)  <  s'  and  a  (p,  p*)  >  0.  If 

V"Vi  V**Vi 

0  <  p*  <1#  9^(0#  p')  >  0  and  fj^O,  p*)  <8*#  whila  if  0  <  p  a  1# 


Oj^B (Pt  0)  >  0  and  faR(p»  0)  c  s*.  FiMlly  fjn(f>#  0)nae  positive 


AB 


M' 


firat  and  null  aecond  coordinate#  so  0)  (  S'  aad 

^AB  (*Xb*0'  ^  binc*  *&«>,  0)  >  0  tha  lattar  inequality 

Implies  >  °* 

Tha  above  argument  shows  that  for  any  (p#  p*)  a  8  thara  ia 

wm§  '  -  v.  ... 

a  point  «  ^(p,  p')  ^  8*.  Since  maps  »'  into  8'  it 

*  {-)  #h 

follows  that^tn4k-l  (p#  p' )  for  n  »  0,  where  t™  is  tha  n  iterate 


Of  fjyj#  i.a.# 


No  man 


33 


^  "  *  *nd  fAW+1^  “  fxif  *  {K*  ‘  ^  4  °*  ®»t  for  any 

(q,  q')  c  8  and  n  *  0, 

fAW(q'  qt)  m  (1  *  (1  "  q)otJ'  1  - 

so 


d(Tn+X-l(p'  p,)*  (1'  1H  ‘d(fA 2)(,PP  )>  (1*  *>> 

*  <a^”  +  a^*) 1/^2  •*  0  aa  n 

Sinoa  (lf  1)  ia  obviously  an  absorbing  atsta,  tha  varification  of 

H10  ia  comp lata.  0.8. D. 

Hara  is  a  ssnpls  of  what  can  ba  concludad  about  Lova joy’s 

Nodal  Z  on  tha  baaia  of  Thsorcm  3.6,  Thaoraa  2.3,  and  Corollary  2.5. 

dltsrlalMUpn 

Thaoraa  3.7.  ypr  anv^ modal  of  tvaa  X,  lin^  (A)  -  1  and 
lian_^Pn (W|A)  -  1  with  probability  1.  Thara  ara  constants  C  <  • 
and  a  <  1  such  that 

j|t#[p''(A)p“(M|A)  J  -  lj|  *  C((v2  ♦  <o2)l/2  ♦  l)oB,  (3.17) 

n  n 

for  all  raal  v,  <u  a  1  AOd.  ooaitiva  intaonra  n.  Tha  total  BUMlUL 

2  fil  »  ai2gom  li  Ilftlti  ilfii  rrnfaihUUY  i  «&4  !*.£*] I  < 

If  ■  1  .q  and  ■  l  -  0* , 

*  „.(2)  •  (1  ’  P>/®  ♦  2U  -  P')/0'. 

P#P 


(3.18) 
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Proof.  The  first  statement  follows  dirsctly  froa  Thaoraa  2.3. 

Tha  sacond  follows  froa  (2.10)  on  taking  ^(p,  p')  -  pVp,<0  and 

2  2  1/2 

noting  that  m(*)  <  (v  +  a>  )  as  a  conasqusnca  of  tha  as  an  valua 

thaoraa  and  tha  Schwartz  inaqua lity.  Tha  third  stataasnt  follows 

froa  Corollary  2.5  on  taking  A^  -  {  (A,  B),  (X,  B) )  so  that 

P  ,(A^)  ■  p(l  -  p‘)  +  (1  -  p)/2.  Sines  tha  function  x(p,  p')  * 
P»P 

{1  -  p)/8  +  2(1  -  p*  )/8*  is  obviously  continuous  with  x(l»  1)  *  0, 

(3.18)  is  provsd  by  verifying  that  this  function  satisfies  the 

functional  aquation  given  in  tha  stataasnt  of  Corollary  2.5  when 

*  1  -  0  and  ■  1  -  0*.  Q.B.D. 

Of  the  two  learning  rata  paraoeters  appearing  in  (3.18),  8* 

is  associated  with  the  response  learning  process  (p^CwjA) j,  while 

9  is  associated  with  tha  perceptual  learning  process  (P  (A)). 

n 

Suppose  that  the  diaci'i«i«ation  problem  under  consideration  (with 

Pj (A)  «  p  and  P^(w|A)  -  p' )  has  been  preceded  by  j  trials  of  a 

previous  (re varied)  problem  with  black  the  positive  stimulus. 

Then  p  will  tend  to  increase  and  p*  to  decrease  as  j  incresses. 

Thus  overtraining  tends  to  decresse  (1  -  p)/8  and  to  increase 

2(1  -  p')/6*.  Which  effect  predominates  and  determine*  the  effect 

of  overtraining  on  B  , [z]  will  depend  on  the  magnitudes  of  6  and 

P>P 

9' ,  large  8  and  small  8*  leading  to  ar.  increase  in  errors  with 
overtraining,  and  email  6  and  large  9*  lead¬ 
ing  to  a  decrease  in  errors  with  overtraining  —  the  "Overlearning 
reversal  effect."  This  oversimplified  argument  ignores  the 
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•f fact  of  the  magnitude*  of  6  and  6*  on  p  and  p' ,  bat  it  nona  tha 

lass  suggests  tha  power  of  (3.18). 

In  concluding  this  aubsaction  it  ia  worth  remarking  that  tha 

theory  of  Section  2  ia  alao  applicable  to  Bush* a  (1965,  pp.  172-175) 

linear  operator  version  of  Wyckoff's  (1952)  discrimination  modal 

whan  P  •  1/2.  In  that  case,  (x  ,  y  ,  u  )  can  be  taken  to  be  tha 

n  n  n 

state  on  trial  n,  and  this  triple  determines  tha  error  probability 
on  trial  n.  Whan  there  are  only  two  learning  rata  parameters, 
e*  >  0  for  (x^)  and  (yn),  and  8  >  0  for  (un),  tha  expected  total 
errors  is  given  by 

S  I2j  «  (1  -  u)/e  ♦  2(1  -  x)/8*  +  2y/e\ 

x.y.u 
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4.  SONS  REMARKS  OH  PREVIOUS  WORK 

Theorem  2.2  includes  the  main  convergence  theorem  of  Onicescu 

and  Mi  hoc  (1935,  saction  5),  and  an  ergodic  thaoram  of  Iona sea 

Tulcaa  (1959,  Saction  8).  it  includes  many  of  Karlin* s  (1953) 

results,  and  has  points  of  contact  with  the  work  of  Lamport!  and 

Suppes  (1959)  and  losifescu  and  Theodorescu  (1965).  Rone  of  this 

previous  work  covers  the  general  non-cyclic  four-operator  (linear) 

model  without  absorbing  states.  Karlin's  results  are  concerned, 

for  the  most  part,  with  two-operator  models,  that  is,  four-operator 

models  with  0,^  «  9^  «  0^^  and  0^^  -  0^^  ■  0^.  The  main  ergodic 

theorem  of  losifescu  and  Theodorescu  (1965,  Theorem  2)  is  not 

applicable  to  any  four-operator  model,  since  one  of  its  assumptions 

is  that  there  is  soma  positive  integer  k,  positive  real  number  a, 

and  response  A.  ,  such  that  response  A.  has  probability  at  least 
X0  x0 

a  on  trial  k  +  1,  regardless  of  the  initial  probability  of  A. 

0 

and  the  responses  and  outcomes  on  trials  1  through  k.  Such  a 

hypothesis  would  be  more  appropriate  if  some  of-  the  operators  in 

(3.3)  had  fixed  points  other  than  0  or  1. 

The  method  of  Lamport i  and  Suppes  is  somewhat  different  from 

that  of  the  present  paper,  and  has  a  certain  shortcoming,  consider 

a  two-operator  model  with  9^  0^  >  0  and  0  <  ir^,  ir^  <  1.  Such 

a  model  satisfies  the  hypotheses  of  Lamport i  and  Suppes*  Theorem 

4.1  (with  nf  -  1,  k*  ■  1  or  2,  and  m.  ■  0)  if  their  event  f  ■  j  * 

v  n 

is  identified  with  *0. .  or  0o,  *  in  the  notation  of  subsection  3a. 

13, n  «]»** 
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Ona  of  tha  conclusions  of  that  thaoram  ia  that,  for  all  positive 

intagars  v,  cu  -  B  [pV]  con  verges,  as  n  -»  »,  to  a  quantity  a!  ■ 
i,n  p  n  #  l 

E[p^]  which  does  not  depend  on  tha  initial  A^  response  probability 

p.  The  a  notation  is  theirs.  This  conclusion  follows,  of  course, 

from  Theorem  2. 2  of  the  present  paper  (along  with  an  estimate  of 

the  rate  of  convergence  that  their  method  does  not  yield) .  But 

the  author  has  found  no  arguments  in  their  paper  that  bear  directly 

on  the  lack  of  dependence  of  the  limit  on  p.  (Their  notation, 

e.g.  a  ,  does  not  even  refer  to  p.)  The  only  kind  of  conclusion 
1  j  n 

that  can  be  drawn  from  the  arguments  given  by  Lamperti  and  Suppes 
is  that  (in  the  notation  of  the  present  paper),  for  any  p, 


EP^Pn+k^°i^j^,kAi^,l 


. . .  0  .  .  A ,  ,  j  ■  E, 

^l’1  V1  J 


'Vi’-"’1*?* 


(pl'Pn1 


converges  as  n  -»  ®  to  a  quantity  that  does  not  depend  on  k  or 

The  recent  corrections  (Lamperti  and  Suppes' (1965) ) 

i^,  j^,  •••»  This  is  not  quite  what  is  required.  ^ 

of  the  Lamperti  and  Suppes  paper  do  not  affect  this  observation.  The 
method  of  Lamperti  and  Suppes  is  an  extension  of  the  method  used 

by  Doeblin  and  Fortet  (1937)  to  study  what  they  call  chains s  (B) . 

It  appears  that  Doeblin  and  Fortet’ a  treatment  of  Onicescu  and 

Shoe's  1  JlflJjgai  completes  (chains s  (O  -  by  maans  of 

their  theory  of  chalnea  (*)  has  the  same  shortcoming. 

A  distance  diminishing  four -operator  model  with  two  absorbing 


ii*  0ii  ^  0  £or  *  "  2  •n<*  •ith*r 


states  necessarily  has  v 
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■  0  or  ■  0  for  i  ft  j.  Thus  it  has  two  affactiva  opa raters 
and,  perhaps,  an  identity  opa ra tor.  such  nodals  were  studiad  by 
Karlin  (1953) ,  and  tha  implications  of  Thaoram  2.3  for  thasa 
nodals  do  not  add  much  to  his  rasults.  Tha  ganarality  of  Thaoram 
2.3  is  roughly  conparahla  to  that  of  Kannady's  (1957)  theorems, 
though  Kannady's  assumptions  axcluda  Lova joy's  Modal  X. 

Tha  argodic  thaoram  of  I one sc u  Tulcea  and  Marina ecu  (1950) 
usad  in  tha  proof  of  Theorem*-  2.1  -  2.3  extends  earlier  work  by 
Doeblin  and  Fortet  (1937,  see  section  titled  "Rota  sur  une  tquation 
f  onctionnelle  ■) .  The  condition  H9  wa.»  used  by  Jamison  (1964) . 

Let  Y  be  tha  total  number  of  response  alternations  for  a 
distance  diminishing  four-operator  modal  with  two  absorbing 
barriers.  That  Y  is  finite  with  probability  1  (see 

Thaoram  3.4)  follows,  in  the  special  case  ir^  «  1,  8  -  0  >  0, 

1  i  i  a  2,  from  a  result  of  Rose  (19S4,  corollary  2  of  Theorem  5). 

Theorems  3  and  4  of  iosifeacu  and  Theodorescu  (1965)  give 
rasults  like  those  of  Thaoram  2.6  of  tha  present  paper  for  a  sub¬ 
class  of  the  class  of  models  to  which  their  Theorem  2  is  applicable. 
This  class  of  models  is  disjoint  from  the  class  of  four-operator 
models,  as  was  pointed  out  above.  However,  once  Theorem  2.4  has 
been  proved,  a  theorem  of  losifescu  (1963)  leads  to  Theorem  2.6. 

To  the  rasults  in  Theorem  3.5  and  its  five-operator  generalisation 


could  be  added 
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“  var  (s  )  -  a2  +  0(n’1//2) 
n  p  m,n 

which  also  follows  from  Theorem  2.6.  in  tha  apacial  caaa  of 

noncontingant  reinforcement  and  c  ■  1,  tha  result  is  a  consequence 

of  Theorem  8.10  of  Bstes  and  suppss  (1959).  A  similar  result  for 

lim  var  (s  )  when  reinforcement  is  noncontingent  and  0  <  c  <  1 
m-eo  p  m>n 

follows  from  formula  (2.16)  of  Yellott  (1965). 
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5.  PROOFS  OP  THBORSMSCONCZRffXNG  STATES 

a.  The  Basic  iroodlo  Theorem 

in  this  Motion  only,  c  (S)  is  the  Mt  of  complex  valued  con¬ 
tinuous  functions  on  8,  and  IMI,  and  CL  ars  redefined 

accordingly  (see  (1*2),  ( 3) ,  (2.4)  and  the  sentence  following 
(2.4)).  The  spaces  c(S)  and  CL  are  Banach  spaces  with  respect  to 
the  norms  |*j  and  ||*||  respectively*  The  space  CL  is  also  a  normed 
linear  space  with  respect  to  | * j .  The  norm  of  a  bounded  linear 
operator  on  c (S)  or  CL  is  denoted  in  the  sama  w«y  as  the  norm  of 
an  element  of  theM  spaces.  Thus  if  U  is  a  bounded  linear  operator 
on  C(S)  its  norm  is 

ju|  -  sup  | Up| , 
f  €  C(S)l 
1*1  *  1 

while  if  u  is  a  bounded  linear  operator  on  CL  its  norm  is 

Hull  -  sup  llu^ll. 
f  €  CLt 

m  *  i 

Finally,  if  u  is  an  operator  on  CL,  bounded  with  m spect  to  | * | , 
its  norm  is  denoted  |uL, ,  thus 

Cl* 

-  »«p  h*i. 

ft  CLt 
1*1  *  1 

If  u  is  a  linear  operator  on  a  linear  apace  W  over  the  complex 
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numbers,  and  If  X  la  a  complex  number,  D(X)  danotaa  tha  M\v  of 
all  x  c  If  auch  that  Ox  ■  Xx»  obvioualy  D(X)  la  a  linear  sub space 
of  If  and  always  contains  0.  If  D(X)  contains  an  element  x  /0, 

X  is  an  alcwinva2.ua  of  u. 

ona  cf  tha  ma  tha  mat  leal  comarstonaa  of  this  pc  par  is  tha 
following  lemma,  which  is  a  specialisation  of  a  uniform  ergodic 
theorem  of  Ionescu  Tulcoa  and  Marina ecu  (1950,  Section  9)  along 
lines  suggested  by  these  authors  (ionescu  Tulcea  and  Marlnescu, 
1950,  Section  10). 

Leimaa  5.1.  Let  U  be  a  linear  operator  on  CL  such  that 

(i)  l«lcl«i. 

(ii)  u  is  bounded  with  respect  to  the  norm  ||*|i>  and 

(111) for  soma  positive  integer  k  and  real  numbers  0  s  r  <  1  and  R  < 

m(U*  if)  i  rm,»  +  rM 
for  all  i>  c  CL.  Than 

(»)  there  are  at  most  £  finite  number  of  eigenvalues 
Xx,  X . . • ,  X^  of  u  for  which  | X ^ |  •  1 , 


(*»)  SSL  SU  PPf.*U**  j^tygyrs  n 
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2 2ttH  v  «s£  &L  oi  1£1  liDftUL  oporators  on  CL,  bounded  with 
respect  to  ||  •  II , 

(c)  of  »  ui#  -  0  for  i  ✓  j  uiv  “  vui  «  0, 

(d)  DfX^)  *  (CL)  la  finite  dimensional .  1-1,  ...,  p,  and 
(«)  for  aoaa  M  <  -  and  h  >  0, 

IIV^II  *  M/(l  +  h)h 
for  all  positive  integer*  n. 

This  lemma  will  ba  applied  to  tha  raatriction  to  Cl.  of  tha 
bounded  linear  operator 

U*<»)  -  8  C^(S_)  ]  -  S  f(t  (8))fe(a}  (5.1) 

acB 

on  C(S)  aaaociatad  with  any  distance  diminishing  nodal.  This 

operator  is  of  interest  because  its  -  ift  iterate,  applied  to 

a  function  f  c  C(S),  gives  the  expectation  of  ^(8  )  as  a  function 

n 

of  the  initial  state!  that  is, 

*  ClK8n)]  -  Un_1^(s),  (5.2) 

n  a  0.  This  formula  is  easily  proved  by  induction.  Xt  holds  by 
definition  for  n  -  1  and  n  -  2,  and,  if  it  holds  for  an  n  a  1, 


then 
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V*(Sn+l)]  - 

*  B>rur,'i^(s2)]  - 1 ftia). 

Theorem  5.1.  IM.  conclusiona  (a)  -  (•)  2 1  ItfJBB.  5.1  hold  for 
ShSL  9,RS ntgl  W(a)  ■  *8U(S2)  ]  aiiggjjjtid  aiiil  inn  dlitance  dimin¬ 
ishing  modal.  la  1341  Uqs,.  1  lisa  liaumiui  Si  u  an£  D(l)  contains 
all  constant  function a  on  3. 

Throughout  the  reat  of  this  paper  it  will  be  aasumad,  without 
loaa  of  generality,  that  *  i  and  A  ¥  1,  j  *  2,  . . . ,  p,  where 
the  A  Vs  are  the  eigenvalues  of  U  of  modulus  1. 

Proof.  The  last  statement  oZ  the  theorem  i.  obvious.  It  thus 
remains  only  to  verify  the  hypotheses  of  Lemma  5.1.  For  any  ^  e  CL 


UV (a)  -  U*(s' )  - 

*  Z  v  hit  (a))  -  i>(t  <a'))')  <9  (a)  +E*(f  (e’)Mo  (a)  -0  (s‘)), 

«  \  e  e  e  •  a  *  e 

so 

|U4(a)  -  U*(e»)|  * 

t  Z  j*(f  (a>>  -  •&(£  (s' ) )  !e  (a)  +  £  |*(f  (a*))||e  (•)  -  *  <8,)j 

•  e  e  e  e  a  a  e 


*  ®(*)  £d(f#(a),  f^(a’))e#{a)  +  |  F|  (I  m(e#)  )d  (a,  a*) 


i  aa  <^)  Z  e#(a)  ♦  |  *}\J£  »(e#)')]  d(a,  a*). 
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by  H7.  Thus  Ui>  €  CL  as  a  consaquancs  of  H6(  wd 


»(U^)  *  m(*)  +  |*|  2  m(<p^). 


(5.3) 


Clsarly 


|U*(a)|  *  2  j*(f  (•))!•  (s)  *  |*|, 


|U*|  *  |*|. 


Thsrsfors  (i)  is  satisfiad,  and 


M  *  n{*)  +  |*|  {1  +  I  ■  (•  )> 


*  [1  +  Z  IUL 


so  (ii)  is  satisfisd  also. 
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Hypothesis  (iii)  off  Leans  5.1  will  new  be  verified.  Let  k(e) 

be  the  integer  and  e„  ,  ...  e  .  ,  v  the  events  in  the  sta tenant 
*  e*l  s#kls) 


of  H8.  For  each  a  c  8  let 


Z<s)  ■  {t  «  84  <p 


®s,l****e,k(s) 


(t)  >  0}. 


Since 


?a  a  M  “’a  <t,9a  ^  • • •  f-  (*  «  Ct) ) 

®1*  *  **n  *1  *2  *1  *n  V**Vl 


is  continuous  for  any  events  e, ,  ...,  e  ,  B(s)  is  open.  Further- 

a  n 

« 

store  s  c  3{s).  Since  S  is  compact  the  open  covering  (Z(s)t 
a  c  S)  has  a  finite  subcovering  Z(Sj),  ... ,  Z(sR).  Let  k^  ■  k(a^) 
and  let  K  -  If  t  «  Z(Sj),  let  j  "  ^  j» 

J  -  1,  . k^.  Clearly  it  is  possible  to  choose  e^  ^  +1#  . .., 
e£  K  itybuch  a  way  that 


pa'  a* 

•t,l,##*t,K 


.  (t)  >  0. 


Hypothesis  H7  implies  u. (f  )  a  1  for  any  events  e , , 

V*j  1  3 

so  the  irnxiaHty 


Mf_.  )  <  1 

•t,l*‘**t,K 

ie  obtained  from  B8.  Thus  the  integer  K,  which  does  not  depend 

» 

on  t«  and  the  system  e'  . »  . ..,  e®  of  events  satisfy  H8.  There- 

x»*  *•#* 

fore  it  can  be  assumsd  without  loss  of  generality  that  the  integer 
k  in  88  does  not  depend  on  the  state. 
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If  *  «  CL  and  i;  «'  «  8  than 

Z  6>(f«  .<■»“♦( *  «  (*')))  \  .«•> 

u  \  v  *  '  V*% 

v% 

♦  Z  *<*«  .  <•*>>  (t*  .  <■)  .<••)> 

^  *l****lt  *l**,eit  .  • 

V"% 


Thar* fora 


[»<♦)  —  y*  y  (f  >t  (a)  ♦  |*l  Z  *<*.  ■*J* 

*1*  *  ‘*k  V#-“k 


■o*  CL  la  closed  under  multiplication  and  under  composition  with 
aapplnga  of  8  into  8  for  which  y  (f )  <  «.  Thus  a  «  Cfc  for 

V"*j 

any  events  a^  a^  and 


4  *  X  .  3 

3  -  .  V"*J 


<5.4) 


V"*J 
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It  follow*  that  for  a  /  a' 


w(tf)  I 


l 


V**V 


{uN'(a)  -  Uk^(8«)|/d(8,  *')  S 


+  X  1  n(f  a  )*a  a  (•)  +  I* 

al*  *  *eTc* 


H(f  _  )<1  U1  (f  .  >-l 

V*%  V**®k 


z  m  (<//)  I  k 


[  ‘I 


where 


0  -  •  •  •  fi.  s 
1  k 


2,  j  v.  ...<■>  +  I*K 

Ik 


I,  t  M  0,  • 

1  k 


n(f.  «  .  J*1 

1**  k  1**  k 


5  n»W  <kA  +  (1  -  AW  +  |'H\i 


max 

V“V 

K«a  a  J<1 
1  k 


M.Cf  _  )  <  i 

1**  k 


(5.-5) 


and 


nan 

S€d 


i 


VV 

n(f  .  )<l 

•1...#k 


(8)  >  0. 


(5.6) 


The  latter  inequality  ia  a  consequence  of  H8,  H5,  and  the  contin¬ 
uity  of  the  <p 


V"*k 


Therefore 
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m(uNr)  *  ra(^)  +  WjJfl  (5.7) 

whara  r  *  kd  +  (1  -  A)  <  1,  and  (iii)  of  Lemma  5.1  ia  aatiafiad.  Q.E.D. 

Though  it  conatitutaa  a  alight  digraaaion  and  will  not  ba  uaad 
in  the  sequel,  an  additional  consequence  of  (5.3)  and  (5.7)  ia 
worth  pointing  out.  From  these  inequalities 

mivPf)  *  m(ii)  +  nlV'jm^  and  (5.8) 

mtu*115^)  a  m(^)  +  |  *1^/(1  -  r),  (5,9) 

n  s  0,  are  easily  obtained  by  induction,  these  yield,  on  combination, 

m(Uj*)  s  m(  0)  +  |*|((k  -  l)m1  +  n^/d  -  r) )  *  (5.10) 

valid  for  all  he  CL  and  j  i  0.  It  follows  that  (uV)  is  equi- 

continuous.  This,  together  with  the  fact  that  CL  is  dense  in 

C(S)  (as  a  consequence  of  the  stone -Weierstrass  Theorem)  and 

|u|  ■  1,  implies  that,  for  any  ^  e  C(S),  (uV)  is  equicontinuous. 

1965) 

in  the  terminology  of  Jamison  (1964, ^the  operator  U  on  c(S) 
associated  with  any  distance  diminishing  model  is  uniformly  stable, 
in  the  terminology  of  Feller  (1966)  the  corresponding  stochastic 
kernel  X  is  regular. 

b.  proof  of  Theorem  2.1 

The  following  lemma  includes  most  of  the  assertions  of 


Theorem  2.1. 
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14WW  a.  <  yog  anv  dlatanco  dliilnlahlnq  th**»  la  • 

atochaatlc  Kp.r;jLk  k"  ££b  &£*  (2.5)  teldi>  whort  s#0(sj]  i£ 

qfvan  bv  (2.^ ..» . 

Proof  pj  Lmtmm  5.2.  Thaoram  5.1  inpliaa  that 

io^i  *  i  iio, ii  +  #v"ii  i  im.i 

i*l  1  i-1  1 

aa  n  •»«.  Tharafora  thara  ia  a  constant  w  <  •  auch  that 

Iio11 |j  <  W  (5.11) 

for  all  n  a  0. 

-  n”^  j 

Lot  u  -  UA)  2  ir.  Than,  by  Thaoram  5.1, 

"  j-0 

n  . 

5  -  (1/h) (I  -  o”)  +  (1/h)  2  U3 

j-1 

p  n  ,  \  n  , 

-  (1A)(I  ~  u  )  +  ( 1A )  Z  I  2  X3  j  u.  +  (1A)  2  v3. 

i-1  N  j-1  j-1 

Tharafora 

p  n  . 

U  -  0.  -  0A)(I  -  U°)  +  (1A)  2  X.[(l  -  x")/(l  -  X.))U.  ♦  (1A)  2  V3, 

n  1  i-2  1  1  1  1  j*l 

|6n  -  ^8  «  cA 


oo  that 
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whara 


P 

c  -  (l  +  w)  +  2  z  UuiH/| i  -  +  M/h . 

i«2  A 

Thus,  for  any  it  e  CL, 

Bun*  -  *  !!un  -  uxIM  *  cIM/h,  (5.12) 

and,  a  fortiori, 

lim^lu^  -  Ux*|  -  0  (5.13) 

for  all  f  e  CL. 

Sinca  CL  ia  danaa  in  C(8)  and  |Un|  *  1  for  n  *  1,  it  follows 
that  (5.13)  holds  for  all  it  c  C(S),  whara  has  baan  axtandad 
(uniquely)  to  a  bounded  linear  operator  on  C(S).  Since  the 
operators  on  c(S)  are  all  poaitive  and  present*  constants,  (5.13) 
implies  that  the  same  is  true  of  U^.  Thus,  for  sny  s  «  s,  0»^(a) 
is  a  poaitive  linear  functional  on  C(8)  with  ur(s)  ■  1  whore  T(s)  a 
1.  Hence,  by  the  Rieas  repreaentation  theorem,  there  is  a  (unique) 
Borel  probability  measure  K**(a,  • )  on  8  such  that 

)«"(•»  «Js')  (5.14) 

for  all  it  «  C(S).  In  view  of  (5.14),  (5.12)  reduoes  to  (2.5). 
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That  K*  is  •  stochastic  karnsl  follows  fron  tha  fact,  now 
to  ha  proved,  that  K*(° ,  A)  <  Cl.  for  ovary  Boral  subsat  A.  This 
is  obviously  truss  if  A  *  8.  Suppos j  that  A  is  an  opan  sat  such  ( 
that  its  costplaasnt  X  is  not  anpty.  For  j  *  1  dsfins  ^  t  CL  by 

(  1  if  d(s,  X)  •  1/j 

1L(«>-#  „  (5.15) 

V  \  jd(s,  X)  if  d (s,  3t)  *  1/j. 

* 

Than 


/I  if  s  c  A 

W»,  1.  (•)  -  „  -  V  (5.16) 

’T  1-*  3  lo  if  ■  t  *  * 


whara  is  tha  indicator  function  of  the  sat  A,  and  tha  convergence 
is  nonotonic.  Tharafora 


—  lin  U.i) .  (■)  -  /I.  (»•)%"(•,  ds*)  -  k"(s,  A)  (5.17) 

)-»  1  3  *  A 

for  all  s  «  8.  By  Theoxen  5.1,  0(1)  ■  ^  (CL)  is  a  finita  dimsn- 
sional  subspa ca  of  CL.  Banco  thara  axicta  a  constant  J  <  •  such 
that  Ifl  a  j|g|  for  all  f  c  0(1).  Tharafora 

•  .  '  '  .  '  ' 

IVj(*i*  “V^a^  «  *(u1ni)d(s1,  s2) 

i 

a  j|olnj|d(s1,  sa)  a  Jd(s1#  s2) 

for  all  j  a  1  and  s^,  »2  <  8.  Bquation  (5.17)  than  yialds,  on 
lotting  j  approach  •# 

|jt*(»2#  A)  -  JC^Sj,  A)  {  a  jd(s1#  s^. 
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3f  A  la  an  arbitrary  Boral  Mt,  i^,  a^  c  8,  and  «  >  0,  the 
§§ 

regularity  of  K  (s.,  •)  Insures  the  existence  of  an  open  aet  A. 

1  XjC 

ouch  that  A.  D  A  and 

k"*(b^V  A^  )  “  a)  i  < 

for  1  ■  1,  2.  Thus  A  ■  A.  r)  A  is  open  and 

€  lf€  «#€ 

0  a  K*#(a1,  Af)  -  x"(ai.  A)  t  «, 

1  ■  1,  2.  Corbinatlon  of  these  inequalities  with  the  result  of 
the  last  paragraph  yields 

|K*(.r  A)  -  k"(s2.  A)  I  s  Jd(a1#  a2)  ♦  2c, 

or,  since  c  is  arbitrary, 

A)  -  k"(o2.  A)  |  S  Jd(s1,  a2). 

Thus  K**(«,  A)  c  CL  with  m(x"(»,  A))  t  J  for  all  Borel  subsets 
A  of  8.  0.1.0. 

Actually,  this  proof  gives  (2.5)  for  the  complex  ae  well  as 
real  valued  functions  f ,  though  this  is  not  important  here. 

To  couplets  the  proof  of  Theorem  2.1  it  remains  only  to  prove 
lams.  s.3.  Ha.  ltaslmUa  Xirail,  (1A)I^“J  converges 
MaUffioaly  ia  k". 

Proof.  Denote  (l/n)!^”*  by  K#. 
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Since 

i  (»i  S)  i  K  (•,  o)  i  *w(s,  8)  (5.18) 

n  n  n 

it  suffices  to  show  that  if  A  is  open, 

K*<s,  A)  -  «  s  K  <s.  A)  (5.19) 

II 

i 

for  all  s  if  n  is  sufficiently  large,  while  if  B  is  dosed, 

K  (a,  B)  a  *“(.,  B)  +  c 
n 

for  all  s  if  n  is  sufficiently  large.  The  statement  concerning  r 
closed  sets  follows  from  that  concerning  open  aets  by  taking  com¬ 
plements,  so  onl^  open  sets  need  be  considered.  There  is  no  loss 

* 

in  generality  in  assuming  A  /  8.  By  (5.15),  1^(0  a  T^(t)  for  all 
t  ««  so 

kb(s,  a)  a  o^U) 

k"(s.  A)  +  [U^T) ^  (s)  -  K*(s,  A)]  ♦  C0nnj(«)  -  ^rj^ts)] 
a  K**(s,  A)  -  lo^*)  -  K*V,  A)1  -  -  0^1. 

Since  the  convergence  in  (5.17)  is  monotonia  and  the  limit  is 
continuous,  convergence  is  uniform  by  Dlni' s  theorem.  Choose 
j  so  large  that 

^lV’1  “  K"(’'  ^  <  </J* 

Then  (5.13)  applied  to  f  •  Implies  (5.19)  for  all  s  «  8  if  a 
is  sufficiently  large. 


Q.B.O 
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Theorem  5.1  asserts  that  D(l),  tha  linear  apace  of  f  c  CL 
such  that  U9  ■  i/,  contains  all  constant  functions,  if,  in  addition 
it  is  known  that  D(l)  is  ona  dimensional;  that  is,  tha  only  p  e  CL 
such  that  up  »  p  are  constants,  than  it  can  be  concluded  that  tha 
probability  measure  K  (s,  •)  ■  K*{»)  does  not  depend  on  s.  For, 
by  (d)  of  Laima  5.1,  Ujp  is  a  constant  function  for  any  p  c  CL, 
and  thus  for  any  P  c  C(S).  Therefore,  in  view  of  (5.14)y  for  any 
a,  s'  €  S, 


/s*< t)K*(s,  dt)  -  JSH t)K*'(s',  dt) 

for  all  i>  c  C(S).  This  implies  that  K*(s,  •)  »  K*(s',  •),  •• 

claimed.  It  is,  incidentally,  eaay  to  show  that  *"(•)  i*  the 

unique  stationary  probability  distribution  of  (s  ),  from  which  it 

n 

follows  (ireimen  (I960))  that 

1  n 

11*  J  t  P(S.)  -  «[p(S  >1 
n-sr  n  )-l  1 

with  probability  1,  for  any  9  c  c(S)  and  any  initial  state. 
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c.  Proof  of  Thaoram  2.2 

Suppose  that  a  diatanoa  diminishing  nodal  hat  tha  proparty 
that  tha  aaaociatad  oparator  U  on  CL  hat  no  aiganvaluet  of  modulut 
1  othar  than  1.  Than  Thaoraa  5.1  iapliaa  that 

I*  -  ux(  -  |v“|  *  */(i  +  h)n. 

Tha ra fora,  for  any  ^  c  CL 

|u%  -  0^11  *  K||*|/(l  +  h)n,  (5.20) 

and,  for  any  +  c  C(s), 

lim  |U%  -  0  i\  -  0.  (5.21) 

n-m  1 

fro  a  (5.21)  it  follows  that  con  varga  a  uniformly  to  K**,  juat 

aa  uniform  convarganc*  of  K  to  K*  followed  from  (5.13)  in 

n 

Sub  taction  5b.  That  ia  to  say,  whan  (5.21)  hold*  tha  proof  of 
Lamam  5.3  ramaina  valid  if  and  art  avarywhara  replaced  by 
and  U* .  If,  in  addition,  tha  only  f  c  CL  for  which  Vp  •  f 
•ra  conatanta,  than  K*(a,  •)  -  *"(•)  doaa  not  depend  on  a,  aa  we a 
•hown  in  tha  laat  paragraph  of  Subsection  5b.  Therefore  (5.20) 
reduces  to  (2.9)  (with  C  •  (1  h)N  and  o  -  1/(1  ♦  h|,  and  all 
of  tha  conclusion a  of  Thaoraa  2.2  hold.  To  coaplete  tha  proof  of 
Thaoraa  2.2  it  thus  suffices  to  prove  tha  following  two  laaana. 
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Lanana  S.4.  if  a  c.i  stance  diminishing  modal  satisfies  89, 
the.  1  ia.  the  only  eigenvalue  of  U  of  modulus  1 . 

Lamm*  5. 5.  a  dlatanca  diminishing  modal  satisfies  B9, 
than  tha  only  continuous  aolutiona  of  UV'  -  1>  ara  conatanta. 

Tha  arguraanta  given  below  follow  similar  argument*  by  janiaon 
(1964) . 

Proof  of  Lamm*  5.4.  Suppoee  |Xj  ■  1,  X  /  1,  i>  ¥  0,  and  y  c  D(X) 
ao  that  U*'  *  >  y.  |y{*)|  e  C'S)  ao  thara  ia  an  aQ  c  S? 

|v(*Q)i  *  max  |* (a)  |  *  |yj. 

Clearly  y(aQ)  ¥  0.  Now  uy*  »  xy*  whara  it*  «  y/y( *Q) ,  |y' |  « 

M/I*(«0>l  -  *  •"<!  *'  (*0)  -  !•  uV  -  xV  ao  uV(aQ)  -  XB. 

For  n  ■  l,  2,  ...  let  B  -  {*»  y*  (a)  •  Xn).  clearly  K^(a  B)  - 

n  aapty-  u  n 

1  for  all  nil.  since  K(#Q,  B^)  *  1,  ^  A*  ll*^L*t  a^  fe  Bj.  Than 

if*  (*^)  •  X.  Un/(a1)  -  Xn4lf  and  K{n)(a1,  B^)  -  1.  Jut 

|Xn+1  -  Xn|  *  j X  -  1 1  >0.  Since  y*  is  unifomly  continuous  there 

exists  6  >  0  auch  that  d(a',  a*)  <  6  impliaa  |  #*C  *  * )  -  if*  (a*)|  < 

|X  -  l}.  If  a'  c  B  and  a*  «  B  ,  than  {*♦{■')  -  **(s*)|  •  |X  -  l| 

n  n + * 

so  d {»• ,  a*)  i  6.  Therefor*  d(B  ,  B  . )  ft  6,  n  •  1,  2,  .  ..  But 

n  n+i 

B_  Dt_<0  *nd  Dt  (a.)  ao  d  (T  (aQ) ,  T  (a.))  *  6,  n  •  1,  2, 

n  n  v  n+i.  n  i  n  u  n  i 

...  Thus  H9  ia  violated. 


0.1. D. 
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Proof  Z£SSHL  5.5.  Suppose  that  there  exists  «  real  valued 

non -constant  function  f  «  C'3)  such  that  Vt  •  *.  Let  M  “  sex  f  * 

^(sQ)  and  m  «  min  tfr  ■  Then  M  >  m.  Let  -  {at  *{s)  «  m)  snd 

C  -  {si  H*)  *  «}.  Vni>  -  f,  so  tf  ^(sQ)  *  M  snd  0° *<#.)  -  a. 

Therefore  K*n)  (s0,  ■  1  and  (e^  Ca)  -  1,  so  ^»n(*0)  «><* 

C  3t  is,)  for  sli  n  i  i.  By  t.m  u  iform  continuity  of  #  there 
m  n  1 

exists  a  6  >  0  such  that  d(s,  s')  <6  implies  j^(s)  -  V'(a*)|  <  M  -  ra. 

If  s  c  C„  and  s'  €  C  then  |^(s)  -  *(*•)!  «  M  -  m,  so  d(s,  s')  kb. 

n  m 

Therefore  d(T  (O  *  T  (s_))  a  6  for  all  n  t  1.  ThusH9  is  violated, 
n  0  n  1 

So  under  the  hypotheses  of  Lemma  5.5  chore  is  no  real  valued  non¬ 
constant  i>  €  C(S)  for  which  Uf  ■  i>. 

Suppose  i>'  e  C  (3) ,  Xfi>*  «  i>*.  Then 

U  Re  ti*  +  iU  la  f*  -  Re  if*  +  i  Im  if* . 

Thus  DM’1  -  Re  and  U  im  if*  -  Iin  it* .  But  Re  i>*  and  Im  if*  are 
continuous  and  real  valued*  so  Re  if*  snd  Im  i>*  arm  constants. 

Thus  if*  is  <j  (complex  valued)  constant  function.  Therefore  all 
continuous  solutions  of  -  i>  are  constants.  Q.B.D. 


roof  of  Theorem  2.3 


The  first  paragraph  of  subsection  5c  shows  that,  to  obtain  the 

uniform  convergence  of  r  ^  to  the  limiting  herns!  K*"  of  Theorem  2.1, 

end  to  obtain  (2.10)  with  S  [it  is  )]  defined  ss  in  (2.6),  it  suffices 

s  •>“ 

to  prove  Leona  5.6  below.  All  lexams  in  this  subssetion  refer  to 
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Ltvm  5»e.  u  $f*  3®  eiqsnvalaas of  acd^Xus  1  ether  than  l , 

Proof .  suppose  *  xy  w!*er*  jxj  ■  1,  X  ?'  1,  and  f  s  C(8>.  Let 

a  be  a  state  for  which  {#(a  )|  “  |^i»  and  let  C  »  [st  i>{*)  ■ 
o  v  n 

XniMs0)}i  n  »  1,  2...  Stow  tfV( »0)  *  X*V(*0)>  thus  Kia*(®0*  C^)  - 

1,  and  C  —-  T  (an) .  By  HXO  there  exist*  a  sequence  {t  }  such  that 
n  n  u  r 

t  c  T  (a.)  and  lirr  d{t  ,  s.  ,  . )  *  0.  Bene*  lim  l/(t  )  * 

n  a  0  n-*«  n  j  {»Q)  n-*»ry  n 

iMa.  (y  }).  But  tfl  e  cn,  ao  ^(tn)  ■»  X^CSg) ,  which  converge*  only 
if  t(«0>  “  °*  Hence  j^j  -  1  and  ^(a)  ■  0.  Thus  X  is  not  an 


eigenvalue  of  u. 


QnS»Da 


The  proof  that  S  converges  with  probability  1  must  ba  deferred 
n 

00 

until  more  information  ha&  been  obtained  about  K  .  The  nest  two 


lemmas  provide  such  information,  in  the  work  that  follows,  A  ■ 

(a^t  1  *  i  s  n}  is  the  set  of  absorbing  states. 

Lemma  5.7.  jf  b  ,  . . . ,  b  are  any  N  scalars,  there  is  one  and 
i  N 

only  one  ^  €  C (S)  such  that  u f  $  and  f(a^)  *  b^,  i  »  1 ,  . ..,  N. 

This  function  belongs  to  CL. 

Proof.  (1)  unique  net,  a.  First,  the  following  maximum  modulus 
principle  will  be  proved*  If  ^  e  C  (S)  and  m  1>,  then  all  maxima 


of  occur  on  A  (and  possibly  elsewhere).  Let  s^  be  a  state 

such  that  |^(aQ)|  ■  |^|,  and  let  c  ■  fe*  ^(e)  *  ^(eQ)).  Since 
A(e0)  -  V'fSg),  K(n*(eQ,  c)  -  1,  eo  C  ^Tn(#0)*By  1110  there  exists 
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mnc9  *  *(».  Sut  tR  «  C,  ®o  tR)  ■  ^C*0>. 

Thua  t(BQ)  -  j)»  snd  ^(a,  jM  *  l$S* 

Suppose  ROW  th»fc  jfr,  €  Q  (8)  t  V$  ■  f,  Vif*  •*  f* ,  and  ^(3) 

if*  (a)  for  ell  s  e  A.  Let  ^9*  *  $  -  .  Then  •  «  c(S),  Ut&* 1  *  if*  * , 

and  f*  •  (»J  »  0  for  all  a  e  A.  Thus  |$9,|  *  0,  so  if  (a)  *  if*  (s). 

(2)  Existence.  since  uniKsy  -  B  [*HS  .  )L  U%Cs)  •  itM 
““ .  s  n+i 

for  all  a  c  A  end  i>  €  C(S).  Thus  U, ^(a)  ■  lira  u*V(s)»  ^(s)  for 

1  n-*° 

all  a  c  A. 

Let  a CL  with  to,  (a  , )  »  6 ,  . ,  e.g. , 

1  n  i  3  ij 

^(s)  »  (1  -  e^dfs,  a^)* 

‘f 

where  e  •  min^^d  (a^,  a_. )  and  x  is  x  or  0  depending  on  whether 
x  ??  0  or  s  0.  It  will  now  be  shown  that 

M 

y(m)  -  i21  biU1coi  (s) 

ia  the  function  aought.  Clearly  y  e  CL,  and 


7<«,> 


A 


Wi")1 


N 

i-1 


bi"i<*3) 


Finally , 


Vy 


i-l  Wi  -  i-l  "iVi  -  ’  ■ 


C  •  Be  D« 


L^nm  SeB#  For  i  m  lp  •••$ 


M.  let  yL  be  the  continuous  function 
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•tech  that  7i(aj)  -  6^  iq4  U?i  -  y ^  Than 

k"<.,  •)  -  ri(a)8ai<*), 

whara  fi#,  it  £Sjsl  Sfilfil  probability  mfiJfi\Xt  fit  8  fit  a 

and,  for  anv  i>  €  c(8)  and  all  a  s  *f 

*.[*<».>>  -  jj 

H 

Proof.  For  ^  €  C(S)  3a t  ?'(a)  •  ^  >,  (aJiKa^)  and  »  C^. 
.Claorly  f,  +'  e  C(S),  and  V  (*^)  -  *(a^  ■  j  -  1,  ...,  N. 

Also 

a  n 

W  *  tix  ^(•i)Vr1  -  iSl  ^<aJL)ri  -  9,  and 
Uf*  -  00*  -  t  9  •  **. 

Thus,  by  Lama  5.7,  v  *  V,  which  in  tha  aacond  aaaarfcien  of 
Lmvtt*  5.8. 

Now 

n 

J  iMa»)K**{e,  da*)  -  fi^-KsjJ  -  1S1  7i(a)^(ai) 
s 

*  A 
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for  all  f  «  C(g).  This  yields  the  first  assertion  of  the 

leaaa.  Q.B.o. 

Mow  that  it  is  known  that  K**(s,  •)  is  concantratad  on  A 

and  tha  functions  y ^  ara  available,  probability  1  convergence  of 

S  can  be  proved, 
n 

!<■  5.9.  ISL  fl fly  inltlll  «.  f»n  J  converges  with 

probability  1  to  a  point  8^  q£  A.  For  snv  Borel  subset  B 

fil  8*  *"(•,  »)  -  PS(8W  «  B).  la  particular . 

7i(,)  “  Ps{S«  "  mi)f 


i  -  1,  ....  a. 

Proof.  Xt  is  a  ainple  consequence  of  the  triangle  inequality  that 
the  function  d(«,  A)  on  s  belongs  to  CL,  end  clearly  dfa^,  A)  »  0, 
i  ■  1,  ...,  V.  Thus  B  [d(S  ,  A) ]  *  0,  SO  that 

B*Jd(Sn,  A)  ]  jj  X  can|d(«  ,  A)  il 

for  all  n  £  1.  The  initial  stats  is  zegardsd  as  fixed  throughout 
the  following  discussion.  Since 

*B(d(tn,  A)  J  x  can|d(..  A) | 


it  follows  that 


n$x  8)1  *  C|d (• ,  A)|a^  -  a). 
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By  tha  monotone  convarganc*  thaoram  tha  ordar  of  suamaation  and 
expectation  on  tha  laft  can  ba  intarchangad  to  obtain 

d,s„*  A)1  <  "• 

m 

Tharafora  £  d(S  ,  A)  <  «,  and,  consequently,  lim  d(S  ,  A)  «  0 
n*x  n  n-ao  n 

with  probability  1. 

For  any  i  *  1,  . . . ,  N, 

S.[?i<S„«>lSn . V  ■  VW 'V  ■  W* 


•o  ( r .  (s  )}  is  a  martingale.  Since  it  ia  bounded  (by  |?.|),  it 
x  n  x 

con verges  with  probability  1. 

Let  a  be  the  event  "lim  d (S  ,  A)  »  0  and  lim  y.  (S  )  exists, 

n-ao  n  n-a*  l  n 

i  ■  1,  . ..,  N"  in  the  underlying  sample  apace.  The  above  arguraanta 

show  that  P#(0)  *  1.  Let  id  (  0,  Since  S  is  compact,  every 

subsequence  of  (Sn(o>)]  has  a  convergent  subsequence,  and,  aince 

d  (S  (<d).  A)  -»  0  as  n  -*  all  sub  sequential  limit  points  of  (s  (co) } 
n  n 

are  in  A.  Suppose  that  a^  and  a^, ,  i  J  i*,  are  two  distinct  sub- 

sequential  limit  points  —  say  S  (o>)  -»  a  and  S  .  (u>)  -*  a,  ,  aa 

n^  i  n  ^  l 


7,(8  <®))  -*  y  (a  )  -  1  and 

i  n_.  i  1 

7i<Sn*  7i(*i,)  ’  °» 


j  -•  •».  Then 
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which  contradict*  the  convergence  of  (7.  (8  (cu) ) }.  Thus  all 

&  n 

convergent  subsequences  of  { 8  (to) )  convert  to  the  ease  point  of 

A.  Denote  this  point  8  (to).  It  follows  that  lie  S  (to)  »  s  (to), 

gp  n-»p  n  n 

Therefore  lie  S  ■  S  _  with  probability  1.  This  isqplies  that  the 
n-«»  n  * 

asymptotic  distribution  of  is  the  earns  as  the  distribution  of 

s  ,  i.e,j  k"(s,  B)  *  P  (8  e  B)  for  all  Borel  subsets  B  of  s. 

CO  f  ® 

Finally  7i(s)  *  w  ■  a^  follows  by  taking  B  »  Cis^ ) .  Q.B.D. 

This  completes  the  proof  of  Theorem  2.3. 
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6.  PROOFS  OF  THRO RIMS  CONCIRMISQ  KVUfTS 

a.  Proof  of  Theorem  2.4 

The  equality 

*<"V>  . Vl-1>  *  A'isn>] 


can  be  rewritten  in  the  form 


(6.1) 


where 

*(a)  -  (6.2) 

Thua  (2.12),  with  L  *  C(D+1),  follows  from  (2.9),  (2.10),  and 
the  following  lemma. 

lemma  6.i.  Fax,  tax  <Ui<nuist  diminishing  aaflil  Han  it 
&  ssasJata  D  fash  that 

* 

m(P(1>  (A1))  i  D 
for  all  I  a  1  and  A*  C  I1. 

Proof.  For  any  i,  j  <  1,  a,  a'  c  S,  end  «  Ii+^ 


(6.3) 


Horan 


I  (.,».1+3  . » - 

vv  VV  1  1 

L  %....<••»  ^  ,..,(A.1+3 

v*i  V"*i  1  *• 


where 


*1* * **i  *  ****  Vj)*  (V  **•»  •i+1)  €  Ai+j)» 


L 

Lj  e.. 


(.)  Vp-- 
*  •, 


(e)  e, ...e. 1  rf 
a  i  e 


.*(*f+3  _  ) 


•i  *  .  e, 

*1*  *  **i  1  i 


+  X  to.  _  (•)  - 1  (*•))  #.i+j  » 

.  .  *i— ' *i  v-v  £.  .  > 

"l* #**i  "l****i  1  i 


T'hus 


.•>«.  -  vV^i  * 


L  %  .  <»>!p*4'  r.i^1+3  )-r(1)  (Ai+^  , 

e  l-“*i  *e. .  ..e  "  V«i  *  .  (•' )  e  • .  .e 

•  *®i  1  i  *  1  1  i 


I  <v.. 


*1*’ **i 


I»*1’ 

1  1  •l****i  X*’  i 
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i  a.  )  ?  (a)  u(f  )d(a,  •*)  +  «  d(a,  a*), 

j  Li  ^1*  *  •  ^ 

ci*  *  **i 

whara 

n  -  Bax  ml,*11  (A1)) 

1  *n»3 

and  is  givan  by  (5.4).  (Not*  that  t  m  . ) 

Two  caaaa  ara  now  diatlnguiahad. 

Caaa  li  i  ■  1.  Than 

ipUJ^i+j)  .pU)^!})  *  (n,  ^^Jdta,  a*), 
ao  nj+1  *  nj  +  or,  by  induction, 

n^  t  jm^.  (6.4) 

Caaa  2i  i  »  k,  whara  k  ia  an  inta^ar  that  aatiafiaa  Bft  for  all  a. 

It  waa  ahown  in  tha  proof  of  Thaoraa  5.1  that  aueh  an  Inta^ar  axiata, 
and  that  thara  la  a  constant  0  a  r  <  1  auch  that 


(a)  p(f 
**k  *) 


a  r 


for  all  a  c  S.  Thua 


Vomn 


1  •  **  '  v  ^  H  *  tojr  +  %)<!(•»  •  *), 


V*  •  V  +  V 


(6.5) 


Th±«  fortaxla  and  •  tinpla  induction  on  v  i*>iy 


v  V_1  i 

*j+vk  ‘  V  +  *k  <|fQ  *  > 


for  v  *  o.  Thus 


Vv*  ‘  njr  +  -  r) 


*  j»xr  ♦  w^/U  -  r) 


by  (6.4).  But  any  positive  inta^r  J  can  ba  « 
i  -  vk  ♦  j  for  ao«t  v  a  o  and  0  a  j  <  *.  thua 


rapraaantad  aa 


n|  *  f*  ~  ♦  ^/{l  -  r)  -  o 


for  ail  J  *  1. 


b*  Imt  Ql  Corollary  3.  S 


0.1. D. 


Undar  tha  hypothaaaa  of  tfaa  corollary. 
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P*(Ai)  -  Z  P  *A)  (A1)?  (a)  -  0. 
*  i-1  i  1 


Thus  (2.12)  int>liaa 


|P(n)  (A1)!  *  L an. 


for  nil.  Thu*  the  series  2*  ,  P ^ (A*)  converges  in  the  norm 

n*i  s 

||  •  |j  to  an  element  of  CL,  and 


|  "p(n)(Ai)||  s  I  ||P(n)(A1)U  *  La/(1  -  o). 
n-1  8  n-1  8 


(6.6) 


x  _  1  14  »n . Vl-l*  €  * 

0  if  (*n,  ...,  Bn+|-1)  6*  A1. 


Then  X  -  Lm  ,  X  snd  B  (X  ]  -  P  (n)  (A*)  ,  so 
n-1  n  s  n  s 


*  (Xj  -  2  P(n)  (A1) , 


(6.7) 


for  all  s  e  S.  This,  in  combination  with  (6.6),  gives  (2.14) 
Oesrly 


*<•>  '  W  *  x„> 

n-  2 


-  PU)  (A1)  ♦«[*[£  X  |S  ]) 
8  8  n-2  n  2 
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-  P^1}  (A1)  +  Sb[x<S2)L 

and  •  0,  i  "  1,  JS,  If  x'  is  another  continuous 

function  satisfying  the*#  conditions  then  A  «  x  "  X*  €  C(S)  with 

UA  «  A  and  A(aJ  -  0,  i  *  1,  . N.  Thus  A(»)  z  0  by  Theorem  2.3.  q.e.d. 

c.  Proof  of  Theorem  2.6 

As  was  remarked  in  Section  2,  a  distance  diminishing 
mooal  can  be  regarded  as  an  example  of  a  homogeneous  random  system 
with  complete  connections.  Xn  the  notational  style  of  this  paper 
llosifeecu" s  is  somewhat  different},  a  homogeneous  random  system 
with  complete  connections  is  a  system  ((S,<8),  f,  <p)  such 

that  (S,<8)  er.d  (8,  *>)  are  measurable  spaces,  f.l')  is  »  measurable 
mapping  of  E  S  into  S,  ^(s)  is  i  probability  maasure  on 

A# 

for  each  e  <r  r. ,  and  $A(")  is  a  measurable  real  valued  function  on 

£>  for  each  A  An  associated  atochaatic  process  IE  }  for  such 

n 

a  system  sat  !  fie s 

H3'  p  {e  e  A)  ■  tp.  (s)  and 

8  1  A 


P 

a 


e , ,  1  i  j  s  n) 
3 


^  . 

Vf< 


e 

n 


(o)) 


for  n  i  1,  s  t.  1,  and  A  e  Under  H4,  $  can  be  taken  to  be  the 
Borel  subsets  oi  S.  Under  H2,  >t  can  be  taken  to  be  all  subsets 
of  t,  and  (g)  is  s  probability  measure  on  if  and  only  if 
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there  is  a  nen-negative  real  valued  function  ©  # (a)  on  1  such  that 

©A(«)  ™  2  *  (s) 

A  .  © 

•  €A 

and  L  ©  (a)  *  1.  Then  H3*  is  equivalent  to  S3.  Also,  the 
measurability  requirements  on  f  and  ©^  are  *«>ak*r  than  continuity 

of  f  {*)  and  ©  {•),  which  are,  in  turn,  weaker  than  58  and  H7. 

a  e 

The  following  lemma  waa  proved  (but  not  stated  formally)  by 

Iosifescu  (1963,  Chapter  3,  Section  3). 

Lemma  6.1.  If  e  ho»aogeneous  random  system  with  cowlete 

connections  2m.  tie.  property  :QisJl  Shflltt.  il  £  M3H©PSfi  UJ 

poaitive  nunfoers  ylth  s*-=ln£n  <  "  ***£¥  1  *  1»  2.  probability 

00  /  £ 

ESiiM  p  (a  )  011*3,  ausi.  Lfrajfc 

lP^n>  (A*)  -  P“(A/)|  <  eR 

£21.  fill  a  €  S,  n,  1*1,  and.  A  e  *»  ,  tten  fili  fife  £b&  CQagJjm&qPf 
Si  Theorem  2.6  apply  £o  i2l£  flfiggglfiSfid  UfegghUtifi  BXSSaUB.  (Bn3 - 

The  quantity  P  ^ (A^)  is  defined  in  (2.11). 

0 

PO  ^  00  £ 

For  an  ergodic  model  the  probability  measure  P^fA  )  »  P  (A  ) 
defined  in  (2.13)  does  not  depend  on  s.  Therefore  (2.12)  implies 
the  hypotheses  of  Lemma  6.1  with  en  »  Lan,  end  the  conclusions  of 


Theorem  2.6  follow 
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APPEDDXX*  MODELS  WITH  PI KITS  STATE  SPACES 

a.  Theory 

Tha  following  definition  is  analogous  to  Dofinition  1.1, 
Definition.  4  system  (S,  I,  t,  f)  i*  §.  Halt*.  Itlt*.  Kdtl 

H  ®  mi  »  fix*.  Halt*,  mjul*  f.  {•>  la  a  ambsIasl  fit  *  *  s  iots.  s, 

Md  9.  (*)  ll.  &  gapping  si  I  X  S  it*.  afla-BfflgitlaL  nil  numbers 
such  that  2#cB^(s)  s  1. 

Dofinition  1.2  defines  aasociatod  stochastic  processes  (s^j 

and  (ln)  for  any  such  modal.  It  is  possihla  to  develop,  by  tha 

mathoda  of  Sactions  5  and  6,  a  theory  of  finite  at at*  medals  that 

completely  parallels  tha  theory  of  distance  diminishing  models 

surveyed  in  Section  2.  This  will  not  be  dona  hare,  sine*  tha 

results  concerning  atates  obtained  by  these  relatively  complicated 

methods  are,  if  anything,  slightly  inferior  to  those  that  can  be  ' 

obtained  by  applying  the  well  known  theory  of  finite  Markov  chains 

(see  Kemany  and  Snail  (1960)  and  Faller  (1957))  to  tha  process 

(s  ]*  However,  the  results  concerning  events  in  the  ergcdic  csss 
n 

are  new  and  important.  Tharefcre,  a  development  will  bo  presented 
that  leads  to  the  latter  results  aa  directly  as  possible.  Appli¬ 
cations  to  stimulus-stapling  theory  will  be  given  in  Subsection  b. 
Tha  natural  analogue  of  H9  for  finite  state  model*  is 

H9*  Z2£  SSy  ■»  •*  *  8»  Tn(«)  ^  Tn (*' )  i*  not  empty  if  n  is 

sufficiently  large. 
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This  is  equivalent  to 

H9*  The  flnita  Markov  chain  (Sn)  has  a  single  ergodic  set,  and 
thla  wt  is  ragular. 

Tha  tarminology  for  finite  Markov  chains  used  in  this  appendix 

follows  Kerasny  and  Snell  (1960).  By  analogy  with  Definition  2.2, 

a  finite  state  model  that  satisfies  H9*  will  be  called  ergodic. 

The  reader  should  notej  however,  that  the  associated  process  (3  ) 

n 

need  not  be  an  ergodic  Markov  chain,  since  it  may  have  transient 
states.  If  there  happen  not  to  be  any  transient  states,  the 
chain  is  ergodic  and  regular. 

Lemma  1  is  analogous  to  Theorem  2.3. 

tet  i«  Esz  mu  grq9<ais  guts  agflai,  tlrcce  sis.  sga- 

c  <  ~  tmi  a  <  i  tn4  *  ok »ujJ?y.Uon  k°°  gn  , 

st^ch  that 


(S  )]  -  bU(S  )]|  s  Can| v-|  U) 

•  n  • 

l££  iii  ail  yfluasl  tm&laai  *  aa  s  im  n  »  1,  w umjs. 

*[*(S  ))  “  l  *{s)k"({s}).  (2) 

scS 

t  roof .  Let  fi  -n  the  number-  of  states.  To  lacrlrtatv  t:.^ 


us"  of  matrix  notation  the  states  are  denoted  s  , 


s  . 
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Th«  transition  matrix  P  and  tha 
column  vac tor  f*  corresponding  to  f  arm  than  dmfinmd  by 

Pi)  “  K(V  ,nd  'T  "  (3) 

Than 

E  [  ^  (S  )  1  *  (Pn“V).  (4) 

• .  n  l 


for  n  *  1. 

Thart<  ia  a  stochastic  matrix  A,  all  of  whosa  rows  are  the 
same,  say  (a^,  . ..,  a^) ,  and  there  are  constants  b  <  »  and  a  <  l 
such  that 

!  ’  A^l  a  ban~1  (5) 

for  ail  n  *  1  and  1  s  i,  j  s  N.  When  (S  }  is  regular,  this 

n 

assertion  is  Corollary  4.1.5  of  Kemeny  and  Snell  (1960).  When 

(s  )  has  transient  states,  that  Corollary  can  be  supplemented  by 
n 

Kemeny  and  Snell's  Corollary  3.1.2  and  a  straightforward  additional 

argument  to  obtain  (5).  Let  R*  be  the  probability  measure  on  S 

with  K*({»  ])  ■  a,f  and  let  *{*(S  )’  be  any  coordinate  of  Av*. 

3  3  ** 

Then  (2)  holds  and 

jS  [*(§  >1  -  tU(8  ))1  -  -  (A**)  | 

• .  n  •  1  1 

x 
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-  i  z  r  (pn  -  a  i*  i 
)ml  irV 

*  J!1|(,n‘1>u 

*  Nba"”1 1  . 

Thia  gives  (1)  with  C  -  Nb/a,  Q.B.D. 

The  next  lemma  parallels  Theerem  2.4. 

Lfffllft  2.  For  anv  truodic  finite  state  nodal 

|P>(n)(A<)  -P^A*))  t  r.an  (6) 

Xfi£  ill  n,  til  mg  A1  C  B1..  where 

P*  (A1)  -  Sp^^^asJ),  (7) 

res 

104.  C  and  a  are  aa  in  Ijum  1* 

total*  Jv**t  aa  in  the  proof  of  Theorem  2.4,  (6.1)  and  (6.2) 
hold.  Thus  (6)  follows  from  Leans  1.  n.B.D. 

Theorem  1  is  the  main  result  of  this  subsection. 

XtoUBUl  1-  AH  Pi  the  conclusions  of  Theoren  2.6  hold  for 
M02L  eroodlc  finite  state  n>del. 
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Proof.  A  finite  state  model  can  be  regarded  as  a  homogeneous 
random  system  with  complete  connections,  in  the  same  sense  that  a 
distance  diminishing  model  can  be  so  regarded  (see  the  first  para¬ 
graph  of  Subsection  6c  —  if  $  and  ^  are  taken  to  be  the  collections 
of  all  subsets  of  S  and  S,  respectively,  the  measurability  condi¬ 
tions  in  the  definition  of  such  a  system  evaporate).  Thus  Lemma 
6.1  is  applicable,  and  Theorem  1  follows  from  Lemma  2.  Q.K.D. 

b-  Application  &  g.tlffiulm-8aapllng  Theory 

Consider  the  general  two-choice  situation  described  in  the 
first  paragraph  of  Subsection  3a.  The  state  S^  at  the  beginning  of 
trial  n  for  the  N  element  component  model  with  fixed  sample  size  v 
(Sstea  (1959))  can  be  taken  to  be  the  number  of  stimulus  elements 
conditioned  to  response  A^  at  the  beginning  of  the  trial  Thus 

S  -  (0,  1,  ...»  N),  (8) 

a  finite  eet.  The  event  space  K  can  be  taken  to  be 

«  "  ( (i,  j,  k,  4)s  0  *  i  *  v,  1  i  j,  k  <  2,  0  I  I  i  1)  (9) 

where  i  is  the  number  of  stimulus  elements  in  the  trisl  sample 
conditioned  to  A^,  A^  is  the  response  and  0^  the  trial  outcome, 
and  I  »  1  or  0  depending  on  whether  or  not  conditioning  is  effec¬ 
tive.  The  corresponding  event  operetors  can  be  written 
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f i jll <*)  “  •  +  N  -  a). 


fij2i(»)  -  a  -  snin(v  -  i,  a),  and 


fijkO(fl)  "  ** 


Of  course,  i  elements  conditioned  to  cannot  be  drawn  if  i  > 

N  -  s,  so  the  definition  of  irrelevant  in  this  case. 

The  definition  given  in  (10)  makes  f^^(*)  monotonic  throughout 
S.  The  same  holds  for  (11).  The  operators  given  by  (10)  and  (11) 
are,  incidentally,  analogous  in  form  to  the  linear  operators 


f(p)  «=  p  +  6(1  -  p)  and  g(p)  ®  p  -  0p, 


if  taking  the  minimum  of  two  numbers  is  regarded  as  analogous  to 
multiplying  them.  Finally,  the  corresponding  operator  application 
probabilities  are 


(:) 


(13’ 


where  (p)i  -  B^p  +  6^(1  -  P)  *  CpJ ^  *  6iip  +  6io(1  “  p) '  cjk  i" 

to  be  interpreted  as  the  probability  that  conditioning  is  effective 

if  outcome  O,.  occurs,  and  t  J  )  is  0  unless  0  i  m  *  J. 
jK  \  m  ' 

For  any  choice  of  N  a  v  *  l  and  OS  ^  ,  c  si,  (8)  -  (13) 

define  a  finite  state  model  that  will  be  referred  tc  below  as  a 
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flXtfl  tiflPiA  liSf.  modal. 

Thaoram  2.  A  flxad  sample  site  fflfiflil  with  w  ujd  c, , 


iSL  MJJL  1  *  i*  j  ‘  2  i£  i£32dis.- 


Proof.  It  is  clear  that  if  Sr  <  N  than  the  sample  on  trial 

n  will  contain  elements  conditioned  to  A^  with  positive  probability. 

Given  such  a  sample,  A „  will  occur  and  be  followed  b'»  0.,  with 

2,n  2i,n 

positive  probability,  and  conditioning  will  t  s  effective  with 

positive  probability.  Thus  sn+.^  >  s  with  positive  probability, 

and  it  follows  that  the  state  N  can  be  reached  from  any  state. 

Thus  there  is  only  one  ergodic  set,  and  it  contains  N.  Furthermore, 

if  S  ■■  N  then  A  occurs  with  probability  1,  and  0  follows 
n  1  ,n  li ,  n 

with  porftive  probability.  So  S^+1  =  N  with  positive  probability, 
and  the  ergodic  set  is  regular.  Q.E.D. 

It  follows  from  Theorem i  1  and  i.  that  the  conclusions  of 
Theorem  l.b  are  available  for  any  fixed  sample  size  model  with 

On,  >;  for  all  i  and  i.  Letting  0  be  the  subset 

i}  3-) 

D*  (  (i,  j,  k,  i)  x  j  ■  l1 


of  E,  anJ  *  h (E  ),  where  h  ia  the  indicator  function  of  D, 
n  n 

the  concl  ^ions  of  Theorem  2.6  include  i  law  of  largu  numbers 


and,  possi  a  central  limit  theorem  for  the  number  S  * 

fn,  n 

•  n-1 

A  <  ’  A.  r«.s^or.eea  i.i  the  n  trial  block  starting  on  trial  m. 

3  =m  j  1 

2  2 

A  simple  ^asion  for  c  »  o  can  be  readily  calculated  for  the 

n 

pattern  noc:»  l  <v  “  D  with  equel  under  noncontingenc  reinforccront . 
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Thtoraa  3*  &  fluid  fUGii  lltA  Kdal  with  v  -  1,  0  <  t  - 

ii 

ff2i  “  'i  *  l>  cij  *  c  >  0  U  «iaaflls>  Xht  1*k  at  lun  mabast 

lim  E  [  ( (l/n)S  it  )  2 ]  ■  0 

a  m,n  1 


lad  central  limit  thagna 

S  -  ni  ^  x 

lim  P<  “Jki^ - 1  <  x  )  -  -*z  I  mxp(-t2/2)dt 

n-*n  v  v  n  o  V2ir J  -• 


Mid.  whe  re 

a2  «  ^(l  -  *  )  U  ♦  2(1  -  c)/c). 

Proof .  That 

£[h(E  )  J  *  lira  P(A,  )  -  ir 
*  n-»»  l>n  3. 

2 

follow*  from  (37)  in  Atkinson  and  Estes  (1563).  The  value  O-  ’ 

can  be  obtained  lror.;  (2.17)  and  Atkinson  and  Estes'  formula  (41).  ,E.D. 


The  met? oda  of  thj*  subsection  are  equally  applicable  to  the 
component  mo-i.-.  ju.1!  .lxed  sampling  probabilities  (Estes  (1^59) )  . 
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ON  THE  LINEAR  MODEL 


WITH  TWO  ABSORBING  BARRIERS 
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Abstract 


A  family  of  linear  modal a  for  learning  in  two  choice 
situations  is  considered.  These  models  have  in  common  the 
assumption  that  nonreward  has  no  effect  on  response  probability. 

The  function  y( p)  that  relates  asymptotic  probability  of  one  of 
the  responses  to  its  initial  probability  is  studied  intensively. 

It  is  shown  to  be  closely  related  to  the  total  nurober  x(p)  of 
response  alternations.  The  asymptotic  probability  of  making  the 
less  favorable  response  is  shown  to  be  small  when  the  learning 
rates  associated  with  reward  are  small.  Finally,  soma  of  the 
basic  analytic  function  theoretic  properties  of  y(p)  are  presented. 
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1.  Introduction 

Throughout  this  section  we  consider  a  two  choice  (A^  or  A2) 

animal  learning  experiment  where,  on  any  trial,  response  is 

rollowed  by  reward  (event  £^)  with  probability  t ;  >  0.  Occurrence 

of  A.  or  B,  on  trial  n  is  denoted  A.  or  B.  ,  and  a  subject's 
i  i  i,n  i,n 

probability  of  A^  ^  is  denoted  pR.  When  correction  or  retracing 
is  permitted  after  a  nonreinforced  response  (an  "error"),  asymp¬ 
totic  response  probability  is  found  to  depend  on  a  number  of 
factors,  among  them  the  species  of  the  subject  and  the  type  of 
discrimination  required  (Bitterman,  1965).  In  such  experiments 
reward  has  always  been  set  up  for  exactly  one  of  the  two  responses 
before  each  trial  so  that  reinforcement  is  noncontingent,  i.e. 

«  l-ir^  *  7r‘  Noncorrection  experiments  using  both  contingent 
and  nonccntingen*  reinforcement,  on  the  other  hand,  have  consis¬ 
tently  yielded  very  high  asymptotic  proportions  of  choices  of 
the  response  with  the  highest  probability  of  being  rewarded 
(Bitterman,  Woodinaky,  and  Candland,  1958;  Behrend  and  Bitterman, 
1961;  Brody,  1965;  Msysr,  1960;  Parducci  and  Polt,  1958;  Stanley,  . 
1950;  and  Weinstock,  North,  Brody,  and  LoGuidice,  1965). 

The  possibility  has  often  been  considered  that  the  effect  of 
nonreinforcement  in  these  experiments  migh|  be  nil  or  practically 
nil.  This  assun?tion,  embedded  within  linear  learning  models,  has 
interesting  consequences.  When  coupled  with  the  auxiliary 
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assumption  that  a  rain forced  correction  response  has  the  same 
affect  on  response  probability  as  a  reinforced  original  response, 
it  leads  to  the  transition  rules 


'n+1 


r<i-9)pn  + • if  v„v„  or  a2jB*JjB 
i  U-9)pn  if  or  A1jB*1)B 


(1) 


for  p  .  The  probabilities  that  the  first  and  second  rows  are 
n 

applicable  are  easily  seen  to  be  it  and  1-ir  respectively.  This 
model  is  quite  useful  psychologically  and  quite  well  understood 
mathematically.  It  predicts  probability  matching: 


lia  P  (A.  ) 

n-M»  p  l,n 


7 r 


for  any  value  p  of  p^. 

The  corresponding  linear  model  for  the  noncorrection  experi¬ 
ment  has  transition  equations 


(l-ejp  +  9, 

if  A.  E, 

I  n  1 

l,n  l,n 

if  A,  B, 
2,n  2,n 

p 

if  A.  i. 

n 

l,n  l,n 

(2) 


1  *  *or  ***•  of  the  present  discussion  we  suppose 

that  m  d2  “  9$  but  th#  mor*  9«n#ral  which  might  arise, 

for  instance,  if  the  magnitudes  or  delays  of  the  rewards  8^  and  B 
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were  unequal,  is  also  treated  below.  The  identity  operator  or 

two  absorbing  barrier  linear  model  (2)  has  been  considered  by  many 

investigators  (Bush  and  Mosteller,  1955,  pp.  291-294;  Brody,  1965; 

Weinstock,  e£.  a^.,  1965)  in  this  setting  with  considerable  success. 

(See  the  last  two  paragraphs  of  this  paper  for  an  observation  that  suggests  that 
the  model  may  have  been  more  successful  than  the  authors  of  the  latter  paper 
realised.)  There  are  no  special  difficulties  in  deriving  predictions  from 
this  model  either  by  mathematical  approximation  techniques  (Bush 

and  hosteller,  1955,  pp.  286-291;  Mosteller  and  Tatsuoka,  1960) 

or  by  Monte  Carlo  methods  (Brody,  1965;  Weinstock,  et  al.  1965). 

Nevertheless  there  are  some  large  gaps  in  our  knowledge  about 

this  model.  The  purpose  of  this  paper  is  to  fill  some  of  these. 

The  quantities 

yip)  «  lim  P  (A  )  (3) 

n-*»  p  i,n 

and 

X(p)  11  B  [total  number  of  response  alternations]  (4) 

P 

are  of  basic  interest.  There  are  practically  no  cases  (i.e. 
special  parameter  values)  for  which  simple  formulas  for  y  and  x 
are  known.  The  research  reported  below  began  as  an  attempt  to 
provide  the  first  proof  that,  in  the  equal  theta  c»««,  the  asymp¬ 
totic  probability  of  choosing  the  unfavorable  side  is  omall,  at 
least  when  9  is  small;  that  is,  y  (p)  -»  0  as  9  -*  0  if  >  rr  , 

9^  *  ^  *  9»  and  0  s  p  <  1.  It  developed  into  a  fairly  general 
investigation  of  the  functions  y  and  x*  The  lemmas  of  Section  2 


provide  the  foundation  for  the  subeequent  development.  The  eein 

%  * 

result  of  Section  3  is  a  relation  between  x  and  y  that  essentially 
reduces  the  study  of  the  foreer  to  that  of  the  latter.  The  two 
special  casea  »  r2  -  1  and  0 ^  -  9^  of  thia  result  are  easily 
derivable  fro*  formulas  relating  the  asymptotic  probability  of 
to  the  total  nuaber  of  runs  of  A1's  that  Bush  (1959,  Sections  5 
and  6)  obtained  by  another  Method.  In  Section  4  bounds  are  obtained 
for  y  that  are  sufficiently  preciee  to  permit  deduction  that  the 
asymptotic  probability  of  choosing  the  unfavorable  side  is  small  when 
the  learning  rates  are  email.  The  theorem  of  Section  5  ia  concerned 
with  the  analytic  character  of  y.  Most  of  the  results  of  that  section 
are  extensions  or  refinements  of  those  of  Karlin  (1953,  Sections  1 

and  5),  who  concentrated  on  the  case  ^  •  Tj  ■  1. 

2.  Fundamentals 

Throughout  the  remainder  of  the  paper  we  will  be  concerned 
with  the  identity  operator  model  (2)  under  the  condition* 

1  *  6l,92'*l,1r2  *  °* 

For  any  (real  valued)  function  ^  on  fO,l]  we  define  |#|  by 

1*1  "  »up  |*(p) | . 

OSptl 

We  let  D  be  the  class  of  all  differentiable  functions  with 
bounded  derivative  on  [0,1],  Such  functions  are  necessarily 
"uMnuors.  If  f  «  D,  its  norm  |p|  is  defined  by 
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Thus  the  relation  lim  ||*  -  1>  ||  -  0  for  i/  ,i>  c  D  ia  tantamount 

n-*»  n  n 

to  the  uniform  convergence  of  %  to  f  and  of  i>%  to  f  .  The 

n  n 

operator 


Vi/(p)  -  B^(Pn+1HPn  "  p),  *••• 

UiK p)  -  i>(  (1-0  Jp-t^)  ^p  +  it((l-92)p)  ff2(l-p)  +  *(p)  (l-TjP-Tjd-p) )  (5) 

maps  o  into  D  and  ia  linear  and  positive  (that  is,  it  preserves 
nonnegative  functions) .  It  can  be  shown  by  a  simple  inductive 
argument  that 

u%(p)  *  8p^(Pn+i^>  (6) 

0  *  p  *  1,  where  0°  is  the  nth  iterate  of  the  operator  U. 

lemmas  1  and  2  below  follow  easily  from  the  results  of 
Norman  (1967,  see  especially  Subsection  a  of  Section  3). 

fatataft  1  •  12ml  gftiY  absorbing  iliitl  {barriers)  si  iM  *flK?v 
kkmi  (PnC-i  ***■  °  tod  i.  £2is.  EiasuBt  ggHYimu  ?ASh  smb afelUix 
i  is.  i  jandfia  jJusihins  Mi  p„-  Xbt  Mffitaasi  lPp(Ai,n^n-i  ssn~ 
ami  u.  n  -*  •  ia 

7  (p)  *  pp{P„  "  D  •  (?) 

12m.  fmicUan  7  frllgpqi  £2  D  li  t h«»  only  continuous  solution 
sl  ifci  imsiisMl  tt.ai*U.9n 
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Vy  ■  y 


(8) 


that  Mi  sha  boundary  valuaa  ?(0)  »  0  and  y(l)  «  l.  Tha  function 


7,  x(p)  •  a  <1-7(0))  +  b7  (p) 

■fD 


la  S&L  soix  ssaUmism  faction  auch  tMi  b  -  ?a<b, 

7a  b(0)  “  •»  7a  b(l)  •  b  Thtra  l£4  a  <  1  jmi  C  <  -  ia<ill 

I  *.[*(pn>]  -  *.U<pJl  I)  *  Can  (1^8  -xi) 


for  oil  f  c  D  ind  nil. 

lam.  2.  Du.  total  nuflbtr  *  si  altarniUpni  s^jsmh  ulmssrsul 
la  finlta  KiUl  probability  1.  la  Hal,,  lax  aim  u  «  p  f  1, 

«p[Y)  <  Da.  fimsUsn  X<p)  -  *pW  tadsim  £<*  »  mo.  Urn 
UfliSUf  continuoui  MlttUsa.  Si  SllM.  luaSSASWU.  •SS&&1&B. 

x(p)  -  (J*VrSVp*11?)  +  v*tp*  uo> 


for  which  x(0)  *  X<1)  “  0. 

Lot  X  bo  tho  indicator  random  variable  for  tha  avant  A 

n 

Sinca 


Y  •  J Jx.n  -  x,<  ' 


1  ,n‘ 


it  follova  that  (Xft)  con va T9* a  with  probability  1.  Lot  bo  tha 
limiting  random  variabla.  Claarly  X^  <  (0,1)  with  probability  l. 
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and  it  is  plausible  that  with  probability  1.  To  prove 

this  we  note  that 

P  (X  /  p  )  *  P  (X  -  1,  p  -  G,  +  P  (X  -  0,  p  *  1) 

p  00  oo  p  oo  oo  poo  m 

-  E  fx  (1-p  )  ]  +  E  [  (1-X  )P  ] 

P  OO  oo  p  00  JO 

*  lim  E  [X  (1-p  )]  +  lirn  E  [(1-X  )p  ) 

p  n  rn  p  n  n 

n-*oo  r  n-»» 

*  lim  E  [B[X  (l-pn)|pnJJ  +  lim  B  [e[  (1-X^p^p^]  } 

-  lim  E  [(1-p  )B[X  ipj]  +  lim  E  ( p„  (l-*txn  t pj )  1 

n-»»  ^  *  n-*« 

-  2  lim  E _f  <l-pn)p  }  «  2E  [{x-pjpj  -  0. 

n-*»  v  v 

Therefore  >  (p)  has  the  following  behasdoral  interpretation! 

>  (p)  *  P  (X  *»  1) , 

*  p  Ow 

Since  X  is  an  indicator »  X  ■  1  m^ans  that  X  *  1  for  all  but  a 
n  m  n 

finite  -.umber  of  n.  i.e.  A  for  all  but  a  finite  number  of  n. 

l.n 

A  standard  notation  for  the  latter  event  as  lim  inf  A,  .  Tbere- 

l,n 

fore  > (p)  *  P  (lim  inf  A,  ). 

p  l.n  a 

If  *  t  c  with  *(0)  »  o  and  i(l)  *  1  (e.g.  *(p)  »  p)  then 

£  [s'(p  )3  ■  * (0) (1-? (p> )  +  V<U?(p)  *  7  (p) . 

P  " 


Thus  (9)  and  (6)  imply  that  !|  U*V  -  7 II  converges  geometrically 
to  0  as  n  -*  •».  This  gives  us  an  iterative  method  of  approximating 
7  that  should  be  useful  in  numerical  computations. 

The  functions  y  and  X  andthe  operator  U  depend  on  four  para¬ 
meters.  Ahen  it  is  necessary  to  call  attention  to  the  dependence 
on  acme  of  these  parameters  we  use  notations  such  as 

7  (p:9-,  and  Ua  .  The  study  of  the  dependence 

12  12 

cf  7  upon  its  parameters  is  considerably  simplified  by  the  follow¬ 
ing  lemma  whicn  states  the  obvious  fact  that  the  probability  of 
absorption  on  response  A^  when  the  parameters  are  9^,  B^,  n ,  and 
t r2  and  p  is  the  initial  probability  of  A^  is  the  same  as  the  prob¬ 
ability  of  absorption  on  when  the  parameters  are  B^,  B^, 
and  7r  and  p  is  the  initial  probability  of  A^  so  that  1-p  is  the 
initial  probability  of  A^. 

Lemma  3. 

y{pie1,92,  *l,*2)  “  l  - 

Another  proof  of  this  equality  is  obtained  by  verifying  that  the 
function 

♦  (p)  ■  i  -  7 


belongs  to  D,  a  id  satisfies  the  functional  equation 
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and  the  boundary  conditions  * (0)  *  0,  *(1)  -  1.  We  use  a  similar 
method  to  prove  the  following  lemma,  which  ia  only  slightly  leas 
obvious  than  the  precading  one. 

Lemma  4.  Fox  my  0  <  x  s  l/bax(ir  ,ir2) 

y  ipixyr^^vj  - 

In  other  words,  y  depends  on  ir^  and  only  through  their  ratio. 

Proof.  The  functional  equation  (8)  for  ^(p;^,^)  is  equiva¬ 
lent  to 

(v  P  +  Tr2(l-p))y(pfir^tw2) 

-  ^pytd-^Jp  +  ir2)  +  7r2(l-p)r((l-02)p;7r1,v2). 

Multiplying  both  sides  by  x  we  see  that  satisfies  an 

equation  equivalent  to  the  functional  equation  for  y  (p; xrr^xvj O.B.D. 

In  particular,  the  absorption  probability  y{ p;V, v)  for  the  case 

^1  *  V2  “  v  d°®8  n0t  d®P®nd  on  v*  * 

We  now  describe  a  method  that  permits  us  to  obtain  bounds  on 

the  solutions  ?  and  x  of  the  functional  equations  (8)  and  (10)  by  * 
solving  corresponding  functional  inequalities. 

Dfe.Ciai.tlQa«  A  function  V/  on  [0,1]  is  superrecrular  (regular, 
sub regular)  if  aQil  only  ii  1>{p)  %  (®,  s)  tty(p)  for  all  0  *  p  s  1. 

These  concepts  are  standard  in  the  potential  theory  of  Markov 
processes  (see,  for  instance,  Kemeny,  Snell,  and  Knapp  (1966)). 
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The  next  leone  shows  the  usefulr^ss  of  these  notions  and  justifies 
the  terminology. 

Luma.  5.  1*1  MDtn  sgperreoular  (mhmgaJjuJ  3fiSll  *(0)  «-  a 

and  *(1)  -  b.  HjfiH  ♦( p)  *  (*)  b(p)>  where  7a  b(p)  i JSL  iilS.  £2Er 

tinuous  xaaaiai.  fcmsllaa  with  7a<b(o)  *  a  7a#bd)  *  b. 

Proof.  Since  ^(p)  2  (*)  U^(p)  and  l/1  i»  a  positive  linear 
orator*  w"^{p)  *  (<)  a"+V(p)  for  all  n  t  0.  But 

lim  l^iMp)  «  K  UCpJl  ■  7a  b(p) 
n-*»  p  * 

by  Lemna  1.  Thus  ^(p)  *  (*)  7  .  (p)  for  all  0  t  p  *  1.  O.E.D. 

Lemma  5  implies  the  slightly  more  general  Lemma  6. 

Lemna  6.  Suppose  flat  ♦<  and.  ?  £  D(  AJJL  J fllHi. .iMgttePf 

> 

vanish  0  jjQd  &£  1,  jad 

*(p)  *  (*)  g(p)  +  yf(p), 

ythilt 


^(p)  ■  g(p)  +  u*(p). 

Then  tf(p)  i  {*)  * (p)  for  £11.  0  *  p  *  1. 

Proof.  Let  A(p)  “  *(p)  -  *(p).  Then  A  c  D,  A  Is  superregular 
(subregular) ,  and  A(0)  “  A(l)  ■  0.  Hence  by  Lemna  5, 

A(p)  *  («)  70  Q(p)  "  0 


13 


for  all  0  *  p  s  1.  Q.B.D. 

We  remark  that  Lemmas  5  and  6  generalize  immediately  to  the 
general  absorbing  distance  diminishing  models  treated  by  Norman 
(1967). 


3.  Alternations 


Let  I  be  the  identity  function  on  the  real  lines  1 (p)  z  p, 
and  let  B  (p)  *  p(l-p).  A  simple  computation  yields  the  following 
important  result. 

Lemma  7.  U1  ■  I  +  (0  t  -  0 

Taking  0j.it  *  92*2  and  usin9  characterization  of  y  given  in 
Lemma  1,  we  obtain 

SfafiaCMB  1*  *  92^2,  then  y  -  I. 

When  9j.it  /  ®2T2  Jjenuwl1  7  le«?ds  to  a  relation  between  y  and  x» 

Ihaprern  2.  ll  0^  /  0^,  then 


X 


(I .  , 

Vrt'i  r 


(11) 


Proof.  Let  the  function  on  the  right  be  denoted  F.  Since 
1  and  y  are  continuous,  P  is  too.  And  since  1  and  y  agree  at  0 
and  1,  F(0)  ■  F(l)  *  0.  Finally 


4  ■: 


by  (8)  and  Lamms  7.  So 


UF  -  f  -  (2“917r1“92Tr2,B’ 

which  is  equivalent  to  (10).  Thus  Lemma  2  implies  X  ■  E.  Q.B.D. 

By  evaluatinq  both  sides  of  (11)  at  the  random  point  and 
taking  expectations  we  obtain  a  relation 


K[Y] 


a-Vrya  (p(\.i>  -p(p. 

8  _ir  -0,  n. 

2  2  11 


D) 


between  the  three  quantities  E[Y],  P (A.  . ) ,  and  P (p  ■  1)  that 

1)1'  “ 

does  not  depend  on  the  distribution  of  p  .  This  relation  can  be 
tested  empirically  once  and  9^  have  been  estimated.  Alter¬ 
natively)  when  m  62  m  it  can  138  U8®d  to  estimate  8.  since 
the  abt»vL.)tion  probabilitiea  are  not  changed  when  and  ir^  are 

both  nuj.«.iplied  by  the  same  constant,  the  quantity  P(A  )  - 

*1* 

P(p.  -  1)  car*,  be  cancelled  to  obtain 


iil'llJCj-  .  il 

«<X’:CY]  X 


(2  -  x<®1ir1+02ir2^ 

(2  -  xM8  r+8  r  ))' 
11  *2 
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where  the  reinforcement  probabilities  corresponding  to  K^[Y) 
are  zir^  and  zTj,  In  particular,  if  x  and  x*  are  samll,  or  6^  and 
0^  are  small,  or  some  combination  of  the  two, 

B(x^  [Yl  •  X1. 

s(x')tY)  * 

In  Section  4  we  use  (11)  and  our  results  on  the  asymptotic 
behavior  of  y  as  the  thetas  become  small  to  obtain  an  asymptotic 
expression  for  x« 

The  next  lemma  will  help  us  derive  bounds  for  X  when  - 

Vj- 

Lemma  8.  I£  9^  -  9^  then 

(1  -  ^^maxt*^,*^)  )B(p)  *  UB(p)  «  (1  -  e1®2B‘in*irl,ir2^B^ . 

Proof.  A  etraiorhtforward  computation  shows  that,  for  any 
®1*  92>  »*»d  *2 

OT(p)  »  B (p)  (l  -  [®1x1-82(l-02)ir2]p  -  (1-p) ), 

•e 

When  8^  »  9  2*  2  *****  r*duca*  to 

UB(p)  -  B (p)  ( 1  -  92w&  “  ***1  (W))* 

Since  SjXj  -  #nd  ®lTi  “  9192*2  th*  *•■■■  follows.  Q.B.D. 

Itoama  3-  U.  9l\  -  ®2*2#  iton. 


(12) 


i 
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2—0  Tf  -0  TT  2-0  TT  -0  F 

—  i. JL  -?-L. .  B{p)  *  y(D)  s  _1  b(d) 

O^maxU^F^  P  0]L02min(F1,F2)  P  ’ 


Proof.  It  follows  from  Lemma  8  that  the  function  F (p)  on 
the  right  (left)  satisfies 

F(p)  *  («)  (2-01fi-02f2)b(p)  +  UF(p). 

Since  these  are  the  functional  inequalities  corresponding  to  (10) , 
an  application  of  Lemma  6  completes  the  proof.  Q.E.D. 

These  bounds  are  tight  when  f^-f2  Sma-L^L*  ln  the  impor¬ 
tant  special  case  *  *2  (and  9j.  -  02)  they  yield  x  exactly. 
C9E9U"jy-  whan  -  ®2  -  ®  \  -  *2  -  v, 

y  .  B. 

*  2 
0  V 


f 


*•  sjsaJLl  ksaiaJja  Rates 

In  this  section  we  will  show  that  if  and  f2  are  fixed  and 

(0^,02)  approaches  (0,0)  along  a  line  in  the  9  f  9^  plane  for 

which  ®21f2  ^  ^  1*1*  then  ®  for  a^1  0  *  p  <  1.  More¬ 
over  the  convergence  is  extremely  rapid.  The  inequality  ^ 

^11  >n*V  h®  thought  of  as  indicating  that  *2  is  the  scat  favor¬ 
able  response.  When  9^  ■  9^  this  inequality  reduces  to  f2  >  f 


17. 


as,  intuitively,  it  should.  Analogous  results  for  >  ^2W2 

may  be  obtained  by  applying  the  results  established  below  to 

7  (1-p?  d2,d±>  n2‘  Fl*  3nd  uaing  Lenuna  3* 

We  assume  throughout  this  section  that  *8  confined 

to  a  line  through  the  origin  in  the  0^,  9^  plane  on  which  32w2  ^ 

.  This  line  is  characterized  by  the  ratio  £  *  and 

9 1  and  9  ^  are  of  the  form 

®1  *  *2  “  9  (13) 
0  <  9  s  min(l,l/r).  Clearly  £  >  0  and,  if 

<*>  -  TT^/lTj, 


then 


1  >  toll. 


(14) 


Por  any  x,9  >  0,  let  1/  n  be  the  function  on  [0,1]  defined  by 


»„..w  ■  •“p/9- 


Most  of  our  effort  in  this  section  goes  into  the  proof  of  the 
following  lemma. 

id  Jim  9.  Xhffjre.  «£4.  pcsitive  constants  y  «  y(o>,C>  4M 
*  “  task  that  1>  9  it  aiifeltaalat  1M  9  ia.  saperrecular 

for  all  0  <  9  %  rain(l, l/{) . 


18. 


Proof.  For  any  x  >  0  and  0  *  p  *  1 
(p) 

xp/0-  x8  .(l-p)/8  -x9jp/8  "j 

■  a  ja  xxp  +  •  fjd-p)  +  (l-^p-Xjd-p))  J 

r  xC(l-p)  -xp  1 

-  a(p)|l  +  (a  -  1)  WjP  -  (1  -  a  )t2(1-p)J 

by  (13) .  Thus  *  a  is  subragular  (suparragular)  for  all  0  <  9  s 
x,  w 

rain (1,1/C)  i*  and  only  if 

(#*C(l-p)  .  1)TiP  *  (*>  (1  -  a’^XjU-p)  (15) 

for  all  0  *  p  i  1.  (Tha  raadar  should  nota  that  thasa  inaqualitias 
do  not  involvs  8.)  Hows vs r  tha  diffaranca  batwaan  tha  two  sidas 
of  (15)  is  continuous  throughout  [0,1]*  tharafora  thasa  inaqualitias 
hold  throughout  (0,1}  if  and  only  if  thay  hold  throughout  (0,1) » 

% 

i.a. 


*  («) 


1 

0) 


for  all  0  <  p  <  1.  In  tarns  of  tha  function  V  on  (-•,»)  dafinad 
by 


V(u) 


-  l)Ai  if  u  /  0 

(16) 

1  if  u  *  0, 
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f  a  is  subregular  (superregular)  for  all  0  <  9  *  min  (1,1/0  if 
x»“ 


and  only  if 


f,<e.x>  -  «  («) 


for  all  0  <  p  <  1. 


V(u)  *  /  e^dw 


for  all  real  u.  Since  the  integrand  ia  positive,  V(u)  >0  for  all 
real  u.  By  taking  the  derivatives  under  the  integral  sign  we  see 


V(k)  (u) 


f  1  k  x 

■  /  we 


for  k  *  1  or  2.  Thus  V’ (u)  >  0  for  all  real  u,  so  V  is  strictly 
increasing.  It  follows  that 


H(u)  *  lnV(u) 


is  also  strictly  increasing.  Furthermore 


H"(u) 


i 


V2(u) 


20. 


(w  -  V'  iu)A(u)} 


wA  (u) 


so  H*(u)  >  0  for  all  raal  u,  and  H  is  convex. 

Writing 

Aj,(p,x)  -  H(^x (1-p) )  -  H(-xp),  (19) 

(17)  and  (18)  give 

A  (p,x) 

r^(p,x)  -  a  .  (20* 

We  suat  now  distinguish  two  cases. 

Casa  1.  C  *  1.  For  0  s  p  *  1, 

(d/dp) A.(p,x)  *  ~k{CH' (Cx(l-p) )  -HM-xp)].  (21) 

) 

Since  H'(Cx(l-p))  *  0,  and  C  *  1 

CH'  (Cx(l-p))  -  H' (*xp)  i  HMCx(l-p))  -  H*  (-xp) 

i  0 

since  the  convexity  of  H  implies  that  H*  is  nondecreasing. 

Squation  (21)  then  yields 

(dAp)A^(p,x)  *  0, 


so  that 


Af(l,x)  *  A^fpjX)  S  A^(0,x) 


21. 


for  all  0  <  p  <  1.  But  A^(l,x)  *  -H(-x)  and  A^(0,x)  »  H(£x),  ao 

-H(-x)  i  A,(p,x)  s  H (£x) 

or 

a  ff(p,x)  t  V(Cx) 

for  «11  C  <  p  <  .1  and  x  >  0. 

Case  2.  r  i  1.  In  this  csss 

A,  (p,C*)  *  A.  (p,x)  *  A  (p,x) 

*  s 

since  H  is  nondeci easing.  We  sr  ■  in  case  i,  though,  that  ,x) 

is  nonincressing.  Hence 

A^  (1,  £x)  t  A^  Ip,  x)  i  A^O,*}, 

i.e. 

-H(-£x)  »  A^(p,x)  *  H(x). 

Therefore 


(22) 


lp,x>  1  v(,° 


23} 
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for  all  0  <  p  <  1  and  x  >  0. 

Returning  for  tha  moment  to  the  general  case,  note  that 
lim  V(u)  ■  0,  V(0)  "  1,  and  lim  V(u)  ■  »,  and  recall 
t^at  V  la  continuous  and  strictly  increasing.  Since  Qm  <  1,  the 
equation 

V  {x# )  -  1/C<d  (24) 

has  a  unique  root  x‘  »  x'  (cu,  £)  in  (0,«) ,  while  the  equation 

V(x")  ■  (25) 

has  a  unique  root  x"  ■  x"(<u,£)  in  (-«o,0).  Now  consider  again  the 
cases  discussed  above.  In  case  1  ((  t  1)  let 

y  ■  -x*  and  *  ■  x*/C*  (26 

> 

Then  from  (22), 


*  fj.(p,v' 


while 


fj,(p,z)  i  V(Cz)  -  V(x') 
for  all  0  <  p  <  1.  In  case  2  ((  i  1)  let 


{ ?l ) 


y  ■  -xVC  and  a  ■  x* . 
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Thar,  from  (23) 


*  f^P^y)  f^(p>z)  *  V(x8) 
for  all  0  <  p  <  1.  Therefore,  in  either  case, 


&  *  ff.(p,Y)  and  f;(p,a)  *  ^ 

for  all  0  <  p  <  1.  Referring  back  to  the  sentence  containing 
equation  (17)  we  sae  that  f  g  is  subrejular  and  g  is  super¬ 
regular  for  all  0  <  9  i  min (1, l/£) .  Q.2.D. 


Though  we  will  not  need  them  below,  we  note  that  the  proof 
gives  simple  formulas  for  y  and  z.  The  function  V  is  defined  by 
(16).  The  points  x'  and  x"  are  defined  in  terms  of  V  by  (24)  and 
(25),  and  y  and  z  are  defined  in  terms  of  x*  and  x"  by  (26)  when 
£  a  1  and  by  (27)  when  £  S  1. 

It  is  easy  to  see  that  the  classes  of  superregular  and  sub¬ 
regular  functions  are  closed  under  addition  and  multiplication 
by  nonnegative  constants.  Further,  the  constant  functions  are 
regular,  hence  both  superragular  and  subregular.  For  any  x  >  0, 
i/  fl(l)  >  f  a  (0)  »  1,  therefore 


K  9(p)  ~  1 


is  supexieguiar  or  subregular  if  g  in.  Also  a  c  D,  with 


x,9 


(28) 


24. 


♦  a(0}  *  0  and  ♦  fl(l)  -  1.  Thus,  combining  Lem  a  9  and  Lemma  5 

XtO  XjB 

we  obtain  the  following  theorem. 

Theorem  4.  There  PQ»ltlve  constants  y  -  y(u>,0  sM 
a  »  *(<0,0  each  that 

*y>0(p)  g  7( *  *2  9(p)  (29) 

IflJL  all  0  <  5  i  min  (1,1/0  and  C  i  p  s  l. 

Corollary.  For  any  0  <  9  *  min  (1,1/0  find  Oiptl 

y(p)  i  l/e*(1'p)/9,  (30) 


MS.  ihftl 


uV*or(p)  "  0 

> 

if  0  «  p  <  1. 

Proof.  A  simple  calculation  shows  that 


(p) 


_ i _ 

#*(l-p)/9 


(31) 


and  the  second  factor  on  the  right  clearly  does  not  exceed 
unity.  Q.E.D. 

Equation  (30)  suggests  that  when  the  learning  rates  are  small  the 
probability  of  being  absorbed  on  the  unfavorable  side  is  very  small. 
Combining  (31)  with  Theorem  2  we  immediately  obtain 


-  • 


25 


Theorem  5.  For  0  <  p  <  2. 


X(P) 


Jr. 


d  T r  “9  TT 
2  2  11 


as  9  -»  Q. 

Thus  when  9 ^  >  9^7r^  (or,  more  generally,  when  9^77^  /  9^tt^)  ancl 
9  is  small,  the  mean  number  of  alternations  is  of  the  order  of 
magnitude  of  1/9,  and  tends  to  be  inversely  proportional  to  the 
difference  | ^2n2~^l ^1 ^  **  favorability  between  the  two  responses. 
When  9  t r  *  9  tt  and  9  is  small  the  mean  number  of  alternations  is 

*  *  XX 

2 

of  the  order  of  magnitude  of  1/9  by  Theorem  J.  Thus  if  the 
learning  rates  are  small  we  expect  many  more  alternations  when 
the  two  responses  are  approximately  equally  favorable  than  when 
one  is  much  more  favorable  than  the  other. 


b.  y  j|ja  ari  Analytic  Function 

In  this  section  we  assume  that  the  reiader  is  familiar  with 
the  elements  of  the  theory  of  analytic  functions  as  presented,  for 
example,  by  Knopp  (1945),  whose  terminology  we  follow. 

In  order  to  motivate  our  results,  consider  the  case  9^  •  1, 

9^  <  1.  Then,  since  7(0)  -  0,  (8)  reduces  to 

/(p)  ■  7(g1(p))^1p/[ff1p9-^aU-p.‘ ]  (3 


where 


26. 


^(p)  (1-0  )  p  +  9^ 

(We  will  also  un  tha  notation 

92(p)  ■  (l~02)p 

below. )  Iterating  (32),  and  Trailing  that  y  is  continuous  with 
7(1)  11  1,  we  obtain 

7(p)  *  A(p)/j(p) 

where 

» 

00 

Mp)  «  n  g  (p), 
n-0  i,n 

•  7T 

J(p)  -  n  [g  (p)  +  -*  (1-g  (p))l, 

n-0  1,n  *1  1»" 

91>0(P)  *  P*  and  gA  is  the  nth  iterate  of  gj ,  n  a  L  Since 
91<n(p)  ■  1  “  (l-0^)n (1-p) ,  n  a  0,  the  infinite  products  A(p)  and 
J(p)  converge  for  all  complex  numbers  p.  So  the  formula  (33) 
serves  to  continue  y  analytically  into  the  complex  plane  C.  If 
*2  "  then  J(P)  ■  1  and  7  i«  antire.  If  /  *  then  y  has  a 
pole  of  order  1  at  each  point  that  is  a  aero  of  one  of  the  factors 
of  J(p).  Since  ^p  +  (1-p)  -  0  if  and  only  if  p  -  c  where 

C  “  V<W- 


(33) 
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these  zeros  are  just  the  points  g  1  (c)  that  g  maps  into  c 

1  i  n  X  f  n 

for  some  n  *  0.  Note  that  c  >  1  or  c  <  0  depending  on  whether 
,r  >  v  or  Jr,  <■  tt  .  and  that  the  sequence  c,  g.  (c) ,  g,  ,(c),  ... 

212  . i  1  1|« 

of  po.  ■'s  is  confined  to  [c,<»)  or  (-*>,cl,  respectively,  in  these 
two  ca set . 

These  examples  «nd  that  given  by  Theorem  1  suggest  all  of 
the  possibilities  f^r  the  qualitative  analytic  character  of  7 
that  arise  in  the  geir  ral  case  treated  in  Theorem  6,  which 
summarizes  our  results.  The  theorem  shows  that  7  is  always 
meromorphic  and  occasionally  entire.  Whenever  there  are  poles, 
c  is  the  one  closest  to  [0,i]. 

If  c  is  a  pole  and  its  distance  from  [C,l]  is  less  than  1 
then  there  will  be  points  x  in  [0,l]  such  that  the  Taylor  series 

®  (n)  .  . 

7(p)  =  S  2— (p-x)n 

On  • 

about  x  does  not  converge  for  all  0  s  p  *  1.  In  such  cases  an 
attempt  to  compute  the  sequence  {>  ^  (x) )  by  the  conventional 
method  of  substituting  (34)  into  (8)  and  equating  coefficients  o 
(p-x)n  on  the  two  3ides  of  the  resulting  equation  seems  doomed  ti 
failure.  In  fact,  little  progress  has  been  made  to  date  in  com¬ 
puting  these  coefficients  even  in  cases  where  this  obstacle  is 
not  present.  Theorem  6  gives  some  information  about  them.  For 
instance,  when  c  is  a  pole  the  standard  formula  for  the  radius 


(34) 


of  convergence  yields 
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n _ _ 

»up^  |r<n)  (x)  \M  " 

for  0  i  <  <1.  Mora over  the  theorem  specifies  the  sign  of 
(x!  for  *11  n  *  0. 

To  state  Theorem  6,  son'*  new  notation  will  be  needed.  The 
function  A  is  defined  by 

A(y)  -  tt2(  1  -  (l-<?2)y)  -  7^(1  -  (l-91)y)  (35) 

for  y  *  1.  E  is  the  set  of  all  points  that  map  into  c  under 
repeated  application  of  g^  and  g2>  i.e. 

E  ■  Ig”1^?1  (•  •  •  (gT1  (c) ) ) )  i  nil  and  i  «  1  or  2  for  all  lsjfn)  U(c). 

n  n-1  X1  3 

(36) 

For  any  real  y,  [y]  is  the  smallest  integer  greater  than  y. 

Finally,  we  note  from  (5)  that  U  may  be  regardod  as  an  operator 
on  complex  valued  functions  of  a  complex  variable.  ’.4e  shall 
sometimes  so  regard  it  below. 

!0&p,rejn  6.  Sgpjapae  «2»2  >  0^  and.  1  >  9^,9  ^ 
a.  JL£  *2  >  ^  then  y  can  be  continued  analytically  through¬ 
out  C  -  B.  The  point  c  is  j  pole  of  order  1  and  every  other  point 
Oj  E  Am  Mi  th MI  *  JQla  PA  order  1  or  *  regular  point  Of  7 .  roj£»n 
n  *  1  and  0  *  p  *  1,  ?*n*(p)  >  0. 

b*  II  * 2  “  rl  7  -SIS  J2£  CQhtlnued  analytically  oyer  the 
tntlrt  complex  plena-  IsiJ^  "  uiaj  o  i  P  n, 


0  <  y 


(37) 


(n)  !p)  *  n!  2"  nn("-l)/2. 


jSid-n 3) 


c.  U  T  <  ir  iLsa  there  is  a_  unique  x  >  1  for  wmch 


where  q  *  1  -  rain  (-*  , -^  • 
c. 

A (x)  *  0. 


x.  If  x  is  an  integer,  then  7  is.  i  polynomial  of 
(n ) 

degree  x,  ar^.  y  p)  >  0  fog.  aJJ.  1  «  n  s  x  and  0  s  p  *  1. 

ii-  ix  x  u  n&i  «ji  i as&a&i.  then  y  fe s.  eaaUnufld 
analytically  throughout  C  -  E.  The  point  c  Is  a  pole  of  order  1, 
and  every  point  of  P  is  either  a.  pole  of  order  1  o£  a.  regular 
point  of  y.  For  any  0  5  p  S  1,  7  ^  (p)  >  0  ii  1  <  n  *  [x]  or  if 
n-[x]  is  a  positive  even  integer  ■  and  7  ^  (p)  <  0  it  n-[x]  ij.  a. 
positive  odd  integer. 

Hl£.  fcun.sti9.nai  equation  (8)  faMa  isjL  fill  fifc  jJtt.  oxtensiona 
described  above . 

The  proof  of  Theorem  6  is  fairly  routine,  but  rather 
intricate.  Limitations  of  apece  prevent  inclusion  of  more 
than  the  following  brief  akatch. 

Sketch  of  proof.  An  inequality  ie  obtained  relating  the 
quantities  |  (u"V)  ^  |  for  a  smooth  function  +.  When  #(0)  ■  0 
end  *(1)  -  1  thia  inequality  yialda  a  bound  for  |y^|  on 
takinq  the  limit  as  m  -*  •.  Whan  v  v  thia  bound  la  the 
expression  on  the  right  aide  of  (37),  and  it  follows  that 
7  is  entire.  When  x  y  this  bound  shows  that  7  can  be 
extended  to  a  function  analytic  in  an  oval  including  [0,l] 
and  having  c  on  its  boundary.  Tha  functional  aquation  (8) 
can  ba  rewrittan  w>  «  7  where 


30. 


‘  w*{j5)  *  ii  '4  (z)) w(s)  -*■  *( g^z))  (l-w(z)) 

and  w (s)  «  t ,  ;/(7r,a  +  ir\{l-s)).  Tha  functions  W0?  pr.v'd* 
a  sequence  ox  analytic  continuations  of  y  into  rations  whoso 
union  is  C  -  E.  From  tho  aquation  W-y(s)  »  7(a)  it  can  be 
seen  that  •:  is  at  worst  *  poie  of  order  1  of  7,  and  fro™  this 
it  follows  -hat  each  point  of  E  is  either  a  pole  of  order  1 
or  a  regvlsr  point.  If  c  lit  regular,  all  are  regular.  To 
prove  tho  assertion  about  tbs  signs  of  the  derivative*  of  y , 
we  consider,  for  each  hypothesis  about  tha  parameters  of  the 
model,  the  class  of  nonnegstive  functions  having  derivatives 
with  the  pattern  of  signs  indicated  by  tha  theorem,  with, 
howevei .  strict  inequalitiaa  replaced  by  weak  inequalities. 

It  is  ^hown  that  U  preserves  this  class.  Since  I  :>e  longs  to 
thin  c >ao  and  ‘a  (p)  -»  7^  (p)  a*  m  -♦  •*,  7  o© )  :nga  to 

thin  c ..  »sl  bIpo.  A  supplesentary  argument  ubing  ch  .*  malyticitv 
of  7  •  .a)  is  strict  inequalities.  Fir  -lxy,  the  m.  a*  results 
on  tr  sjors  of  the  derivatives  of  7  <nd  the  eitj-ety  of  7 
if  c  1  n  tegular  point  are  shown  tc  preclude  r<  ularity  of 
c  in  ispp  (a)  and  (cii) .  Q.K.D. 


Surveying  the  cases  treated  in  Theorem  6  ^  see  that  in  all 


of  them  y~{ p)  >  0  for  0  *  p  t  1,  This  implies  that  if  p,  has  a 
distribution  F  with  mean  p.  and  positive  variance,  and  if  corres¬ 
ponding  probabilities  and  expectations  are  indicated  by  subscript 
F's,  then 

P  (lim  inf  A  )-  5  [7(p  )] 

F  i,n  r  l 

*  ?(u)  +  min  >  °(p)E_[  (p.-iO  ‘1/2 
0*p£l 

>  7  (u)  -  P  (lim  inf  A  ) . 

M-  l#n 

Thus,  if  the  distribution  of  p^  over  a  group  of  aninals  has 
mean  1/2  and  positive  variance,  'he  proportion  absorbed  on  the 
unfavorable  side  will  tend  to  exceed  the  corresponding  proportion 
for  rero  variance. ^"since  ell  of  the  etet  rate  for  the  .75  group 
of  Experiment  1  of  WeinstocX,  §j».  (1965)  had  p^  ■  1/2,  end  since 

there  ie  no  reeson  to  believe  that  this  condition  was  eat  by  all 
of  the  reel  rats,  the  above  result  may  help  to  explain  why  a  few 
more  real  rate  than  etet  rets  *»re  absorbed  on  the  unfavorable 


side. 


u. 
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1.  Examples  of  measurement  theory 


1.1  Definition  of  measurement 

Measurement,  in  its  broadest  sense,  consists  of  the  correspondence 
between  mathematical  objects,  such  as  ,cal  numbers,  vectors,  or  operators, 
and  empirical  objects,  such  as  heavy  bodies,  forces,  colors,  etc.  The 
correspondence  is  based  on  an  isomorphism  between  observable  formal  pro¬ 
perties  of  the  empirical  objects  and  the  forma!  properties  characterizing 
the  mathematical  objects.  For  example,  in  the  measurement  of  mass,  posi¬ 
tive  real  numbers  are  assigned  to  heavy  objects,  so  that  the  order  of  the 
numbers  reflects  the  order  of  the  objects,  as  determined  by  a  suitable 
balance,  and  the  addition  of  real  numbers  corresponds  to  combining  cf 
objects 

Many  instances  of  measurement  arc  like  the  measurement  of  mass,  inso¬ 
far  as  they  involve  construction  of  a  real-valued  function  that  preserves 
the  order  and  additive  structure  of  an  empirical  system.  Such  construc¬ 
tions  are  based  ultimately  on  the  theorem,  due  to  Holder  [13].  that  any 
Archimedean  fully  ordered  group  is  isomorphic  to  a  subgroup  of  the  ordered 
group  of  additive  real  numbers.  I  shall  present  a  formal  statement  and 
proof  of  this  theorem  in  the  next  section.  Following  this,  1  shall  present 
two  applications  of  Holder's  theorem.  In  the  first  application,  the  addi¬ 
tive  structure  in  the  empiricsl  objects  is  given  directly,  similar  to  com¬ 
bining  heavy  objects  in  the  same  pan  of  a  balance.  This  is  called  exttn- 
aive  In  the  second  application,  no  additive  structure  is  given 

directly,  but  nevertheless,  an  associative  binary  operation  can  be  defined, 
and  Holder's  theorem  applied. 


1.2  Holder’s  Theorem 


-  2  - 


DEFINITION  1.1  Let  G  be  a  group,  with  binary  operation  (x,y)  -*•  xy  and 
identity  e,  and  let  >  be  a  total  order  on  G.  The  pair  (G,_^)  is  called  an 
ordered  group  if  for  all  x,y,z  e  G,  x  >_  y  implies  both  xz  >  ys  and 
zx  >_  zy.  The  ordered  group  (G , >_)  is  called  Archimedean  if  for  all 
x,y  e  G,  with  x  >  e,  there  exists  some  positive  integer  n  such  that 
xn  >  y. 

We  shall  denote  the  ordered  additive  group  of  real  numbers  by  (Re, ♦,>). 

THEOREM  1.1  Let  (G,>)  he  an  Archimedean  ordered  group.  Then  (G,>J)  ie 
isomorphic  to  a  subgroup  of  (Re.+.i)*  Moreover,  the  isomorphism  is  unique 
up  to  multiplication  by  a  positive  constant . 

Procf:  Let  G+  *  {x|x  >  e).  We  can  distinguish  2  cases: 

A)  G+  has  a  lower  bound  x ^  >  e; 

B)  inf  G*  «  e.  (G  ■  {e >  is  a  trivial  case.) 

In  case  A),  for  any  y  c  G,  there  exists  a  unique  integer  n  (positive, 
negative,  or  zero)  such  that  x ”  £  y  <  x^+*.  If  y  +  x”,  then  Xj°y 
is  in  G*  but  is  <  Xj,  a  contradiction.  Thus,  y  *  x".  Hence,  G  is 
cycle  with  generator  x^,  and  the  theorem  follows;  the  subgroup  of  (Re,*,>) 
is  any  discrete  subgroup. 

For  case  B) ,  let  x  e  G*  and  y  e  G  be  arbitrary;  then  there  exists 
a  unique  integer  N(x,y)  such  that  xN^x,y^  <y  <  xN^X,y^41.  Clearly,  for 
x.x'  c  G+,  y  e  G,  we  have 
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[N  (x ,  x ' )  4  l][N(x',y)  4  1]  >  N(x,y)  >  N(x,x')N(x'  ,y) . 


(1) 


Let  { x^ }  be  any  sequence  in  G  which  converges  to  e.  It  is  easy  to  show 

+  - 1  4 

that  for  any  y  e  G  ,  N(xk,y)  -*■  4*,  while  for  y  e  G  ,  N(x^,y)  ■+ 

For  any  y,z  e  G,  with  z  f  e  and  for  k,£  sufficiently  large,  we  have 
by  (1), 


N(xk,y)  [N(xk,x^)  4  1] [N(x^,y)  4  1] 
N(xk,zT  <  N(xk,x^)N(x^,z) 


(2) 


If  we  fix  t  and  let  k  ■*  =»,  taking  the  limsup  on  the  left  and  right  in 
(2) ,  we  obtain 


N(xk,y)  N(x^,y)  4  1 

1J^up  NT^TzT  -  N(x£,z) 


(3) 


Now  taking  the  liminf  on  the  right  in  (3),  as  £-*■<»,  we  find  that 

lim  N(xk,y)/N(xk,z)  exists  (and  is  finite,  as  is  easily  seer').  For  fixed 

y.  t  G* ,  define 

N(xk.y) 

$(y)  =  li™  (ttj — r-y  • 

k-»-  Nlxk,yiJ 


It  is  easily  shown  that  $  is  an  isomorphism  of  (G,>)  onto  a  subgroup  of 
(Re, +, >) .  To  this  end,  one  can  use  the  fact  that  for  any  x  e  G  ,  y,z  c  G, 
N(x,y)  4  N(x,z)  ♦  l  >  N(x,yz)  ^  N(x,y)  4  N(x,z).  (4) 

To  show  uniqueness  of  the  isomorphism,  let  4*  be  any  other  isomorphism; 
then  clearly,  for  any  k,y 

N(xk,y)$’  (xk)  <_  ♦  ’  (y)  <  (N(xk,y)  4  1 )  ♦*  (xk)  - 

It  follows  that 


lim 

k-*«> 


N(xk,yl 

N(xk,y‘p 


4(y)  , 


-  4  - 


so  that  ♦'  ■  a+,  where  a  «  ♦* (y^D  >  0.  This  completes  the  proof 
of  Theorea  1.1. 

For  a  different  proof,  see  Birkhoff  [3],  p.  300.  The  proof  given  above 
has  the  advantages  of  being  easily  generalized,  and  of  constructing  the 
isoaorphisa  4  in  a  Banner  siailar  to  actual  aeasureaent  procedures.  These 
points  will  be  aade  aore  clearly  in  sections  (1.3)  and  (2.2). 

1.3  Extensive  aeasureaent 

In  extensive  aeasureaent,  one  starts  with  an  eapirical  systea  that 
includes  an  associative  binaxy  operation.  Placing  2  heavy  objects 
together  in  the  saae  pan  of  a  balance  is  one  exnple;  others  are  found  in 
the  usual  aeasureaent  procedures  for  length,  where  rods  are  coabined  by 
laying  thea  end  to  end,  and  for  tiae,  where  tiae  Intervals  are  concatenated 
by  using  the  saae  event  to  aark  the  end  of  one  interval  and  the  beginning 
of  another. 

The  following  set  of  weak,  logically  independent  axioas  is  due  to 
Suppes  [  33] . 

Primitives:  K,  a  noneapty  set 

Q,  a  binary  relation  on  K 

*,  a  binary  function  on  K,  (x,y)  x*y. 

Axioms :  For  all  x,y,x  t  K 

1.  if  xQy  and  y<)x,  then  xQz 

2.  x*y  c  K 

3.  (x*y)«x  Q  x*(y*x) 
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4. 

:  f 

xQy,  then 

x*z  Q  z*y 

S. 

if  not  xQy, 

then  there  exists  w  c  K  such 

that 

x  Q 

y  *w 

and 

y*w  Q  x 

6. 

not 

x*y  Q  x 

7. 

if 

xQy,  then 

there  is  a  positive  integer  n 

such 

that 

y  Q  nx 

[lx 

*  x,  nx  « 

(n  -  l)x  *  xl . 

We  can  prove  the  following  measurement  theorem. 

THEOREM  1.2  If  (K,Q,*)  satisfies  Axioms  1-7,  then  there  exists  a  real- 
valued  function  $  on  K  such  that  for  all  x,y  e  K 
(i)  xQy  if  and  only  if  *(x)  <_  *(y) 

(ii)  4(x*y)  -  ♦(x)  ♦  ♦(y). 

Furthermore,  i  is  unique  up  to  multiplication  by  a  positive  constant. 

Theorem  1.2  includes  a  representation  theorem  for  extensive  measurement 
a  theorem  specifying  that  real-valued  assignments  can  be  constructed  that 
preserve  the  empirically  given  structure- -and  a  uniqueness  theorem,  limit  in 
the  class  of  possible  representations.  Uniqueness  theorems  are  quite  imp  ; 
tant  in  measurement,  since  they  determine  what  sorts  of  statements  about 
measured  values  are  meaningful.  Heasuremcnt  representations  that  are  uniqw 
up  to  multiplication  by  a  positive  constant  are  called  ratio  socles,  becac 
ratios  are  preserved  by  permissible  changes  in  representation.  Thus,  the 
statement  "X  is  twice  as  tall  as  Y"  is  meaningful,  independent  of  the  un 
chosen  for  measurement  of  length. 


To  prove  Theorem  1.2,  it  is  convenient  to  introduce  a  relation  •  on  K: 
x  -  y  if  xQy  and  yQx.  From  the  axioms,  -  can  be  shown  to  be  an  equivalence 
relation.  Moieover,  it  can  be  shown  that  Q,*  induce  a  total  order,  >_,  and 
a  binary  operation,  ♦,  on  the  set  of  equivalence  classes,  K/~.  The  system 
**tisfi®s  *11  the  propeities  of  G*  in  Holder's  Theorem;  in  par¬ 
ticular,  although  inverses  do  not  exist,  K/-  is  closed  under  subtraction  of 
smaller  elements  from  larger  ones  (see  Axiom  5).  This  permits  the  proof  of 
Holder's  Theorem  to  be  carried  through,  with  no  change,  for  (K/-,+,V); 
this  latter  is  proved  isomorphic  to  a  subsemigroup  of  the  additive  semi¬ 
group  of  positive  real  mashers.  The  isomorphism  yields  the  measurement 
representation  required  in  Theorem  1.2.  Uniqueness  follows  similarly  from 
the  uniqueness  arguaent  for  Holder's  Theorem. 

Finally,  I  should  like  to  point  out  the  close  relation  between  the 
construction  of  the  isomorphism  in  Theorem  1.1  or  1.2,  and  actual  procedures 
for  assigning  real  numbers  to  objects.  Consider  the  case  where  K  consists 
of  straight  rods,  and  x*y  is  formed  by  laying  end-to-end  one  replica  of  rod 
x  and  one  of  rod  y.  Laying  rods  side-by-side  permits  comparisons,  esta¬ 
blishing  xQ> ,  etc.  Measurement  is  carried  out  by  forming  a  standard 
tsqutnem  x,  2x,  ....  nx  (we  use  additive  notation)  laying  1,  2,  ....  n 
replicas  of  x  end-to-end.  To  measure  y  in  feet,  we  form  the  ratio  of  N(x,y) 
to  M(x,yj),  where  is  a  standard  foot-ruler.  The  generator  of  the  standard 
sequence,  x,  is  chosen  sufficiently  small  to  attain  any  desired  accuracy  of 
meaaurement.  Equation  (4)  shows  that  the  approximate  measures,  N(x,y)/N(x ,y , 
are  approximately  additive.  The  mein  point  of  Theorem  1.1  was  to  show  that 
N(x ,y)/N(x,yj)  converges  es  x  is  taken  arbitrarily  small. 


1.4  Conjoint  measurement 


In  the  social  sciences,  it  is  rare  to  find  associative  binary  opera¬ 
tions  that  can  be  used  for  extensive  measurement.  However,  it  is  common  to 
observe  an  ordering  of  objects,  where  position  in  the  ordering  depends  on 
the  values  of  2  or  more  independently  controllable  factors.  Such  a  situation 

is  represented  formally  by  a  transitive  relation  >_  defined  over  a  product  set 
n 

A  =  n  A..  The  simplest  law  governing  the  dependence  of  ordinal  position 
i-1  1 

on  the  different  factors  is  an  additive  one: 

n 

*(a! . V  *  .1  w 

1*1 

where  *  is  a  real-valued  ,  order-preserving  function  on  A  and  each  $.  is  a 
real-valued  function  on  A..  If  such  functions  can  be  constructed,  we  say 
that  additive  oemjoirit  meaBurene'  t  is  feasible  for  the  system  •  A  ^ 

The  functions  and  ♦  provide  measurement  scales  for  the  factois  A.  and  the 
observed  output,  relative  to  which  an  additive  law  holds. 

Additive  conjoint  measurement  has  a  complex  history,  however,  it  was 
the  publication  of  a  set  of  sufficient  conditions,  by  I  net  and  Tukey  [24  [ 
in  1964,  that  created  widespread  interest.  The  most  important  anticipation 
of  their  work  was  published  by  Debrcu  (9  ).  The  luce-Tuko>  axioms,  which 
apply  to  the  case  A  ■  A,  «  A,,  are  essentially  the  following. 

fr>t  1 1  its :  Aj  ,  A^.  nonempty  sets 

>,  a  binary'  relation  on  V  •»  A.  *>  A,. 

Art ant ; 

1.  MS*  weak  order;  i.e.,  it  is  transitive  and  an>  2  lienor.;*.  .  •"  \ 

are  comparable. 


s 


2.  Any  change  in  one  factor  can  be  exactly  compensated  by  a  change 

in  the  othe-r;  i.e.,  if  a  c  A,  bj  e  Aj,  then  there  exists  b2  e  A^, 
such  that  a  -  (bj,b2)  C~  means  _>  and  <J ;  and  similarly  for  the 

other  factor. 

Axiom  2  is  called  the  f>ol ^ability  axiom,  since  we  "sol  e"  for  b2,  given 
*-bi- 

3.  For  any  Uj ,a?) , »b2) » (Cj,  -'2)  e  A,  if  (a1,b2)  _>  (bj,c2)  and 
(b,^)  >  (cltb2),  ti-  n  Uj.a^)  >_  (c^c^. 

Axiom  3  is  called  the  cancellation  axiom,  since,  given  an  additive  repre¬ 
sentation,  we  can  add  up  the  2  antecedent  inequalities  and  cancel  b  +  b„, 

1  L 

yielding  the  conclusion.  This  condition  is  illustrated  geometrically  in 
terms  of  indifference  curves  in  the  A^  x  A 2  plane,  in  Figure  1.  In  the 
theory  of  webs,  this  is  called  tne  Thomsen  condition  (see  Aczil,  Pickert, 
and  Rado  [1]) . 


Fig.  1.  The  cancellation  axiom  illustrated  for  indifference  curves.  If 
two  "flights  of  stairs"  are  inscribed  between  two  indifference 
curves,  as  shown,  then  alternate  intersections  lie  on  the  indif¬ 
ference  curve  when  the  cancellation  axiom  is  true.  (Taken  from 
Luce  5  Tukey  ([241,  p .  7 ) ) .  ("The  author  wishes  to  thank  Academic 
Press,  In.  fer  permission  to  reprint  this  figure  from  the 
Journal  of  Mathemat ica I  Psychology.) 


in  A  is  called  a 


.  o  . 


4.  A  sequence  [(a^.a^)  J i , j  *  0,  *1,  *2,  ...} 

dual  standard  sequence  if  (a^.a^)  -  ( aiJc,a2£-*  i  ♦  j  *  k  ♦  t. 

If  {(a^.a^)}  is  a  dual  standard  sequence,  then  for  any  a  e  A, 
there  exist  n.m  with  (a,  ,a,  )  >  a  >  (a,  ,a«  ). 

Axiom  4  is  called  the  Archimedean  axiom.  It  is  easily  verified  thet  Axioms 
1,  3,  «*nd  4  are  necessary  for  additive  conjoint  measurement;  solvability 
is  not. 


THfcOSF.M  1.3  If  (Aj, A^, >)  satisfy  Axioms  1-4,  then  there  exist  real-valued 
flmct-'oru3  *  on  A,  ^  on  $2  on  Aj.  such  that  for  all  (aj  ,a2) ,  (^  ,b2)  e  A 

(ii  (a^a^  _>  (bltb2)  if  and  only  if  ^iKbj.b^ 

(ii)  Hsl,a2)  =  *1(a1)  ♦  $2(a2). 

Furthermore ,  if  <J>',  are  any  other  such  functions ,  then  there  are 

real  numbers  a  >  0,  3^,  32  such  that  ♦  3^,  $'  ■*  a$  ♦  Bj  ♦  82- 

It  should  be  noted  that  the  uniqueness  clause  of  this  theorem  is  the 
best  that  could  be  expected.  Such  a  representation  is  called  interval  scale 
measurement;  ratios  of  intervals  are  invariant  under  permissible  transfor¬ 
mations.  (A  more  standard  term  would  be  affine  scale,  since  the  affine  ratio 
is  invariant.) 

I  shall  sketch  a  proof  of  Theorem  1.3,  based  on  Holder's  Theorem,  which 
was  published  by  Krantz  [17], 

Choose  an  arbitrary  origin  a^  ®  (aj,a2)  in  A.  By  solvability,  any 
equivalence  class  of  A  contains  elements  of  forms  (bj,a2)  and  (aj,b2). 
Define  an  operation,  ♦,  on  A/-,  by 
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(bj.a®)  ♦  (aj,b2)  «  (bj.bO. 

I  shall  show  that  v  is  well-definsd  and  that  is  an  Archimedean 

ordered  group,  where  >_  is  defined  in  the  natural  way  on  A/~. 

If  (bj.a^)  ~  (aj,c2)  and  (a^,b2)  -  (c^.aj),  then  by  cancellation, 
(bj,b2)  -  (Cj,c2).  It  follows  that  +  is  well-defined  and  comutative , 
since  the  equivalence  class  determined  by  adding  arbitrary  representations 
of  the  form  (bj,a2),  (aj,b2)  is  the  same  as  that  determined  by  adding 
arbitrary  representations  in  the  reverse  order. 

To  prove  associativity  represent  3  arbitrary  equivalence  classes  as 
(bj,a2),  (a^,b2),  (c^.a^).  By  solvability  find  c2»d2  e  A2  such  that 
(bj,b2)  -  (aj,c2)  and  (Cj,b2)  -  (a^,d2).  By  definition  of  ♦,  cancellation, 
and  commutativity, 

[(bj.a^)  ♦  (aj,b2)l  ♦  (c^a®)  «  (Cj,c2) 

*  (blfd2) 

«  (bj  a®)  ♦  [(aj,b2)  ♦  (Cj.aJ)]. 

Obviously,  the  equivalence  class  of  a^  is  the  identity,  and  if  (bj.bp  '  a°, 
0  0 

then  (a^.bj)  and  (h^,a2)  are  inverse.  Hence,  (A/~,  +  )  is  a  commutative 
group.  Note  that  the  results  so  far  use  only  properties  of  '. 

If  (bj.aj)  2l  (Cj,a2),  and  (a^.d^t  is  arbitrary,  find  e  A^ 
with  (Cj.aj)  ~  (dj,d2).  By  cancellation,  applied  to  (bj.a^)^.  (dj,d2), 
(dj.dj)  '  (Cj,a2) ,  we  obtain  (bj,d2)  >_  (Cj,d2),  or 
(bj,a2)  ♦  (a1,d2)  _>  (Cj,a.,)  ♦  (»1,d2). 

Thus,  (A/~,*,>J  is  an  ordered  group.  Finally,  the  Archimedean  property 
follows  easily  from  Axiom  4. 
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by  Holder's  Theorem,  there  is  an  isomorphism  $  of  (A/-,*„>)  onto 
a  subgroup  of  (Re,  +  ,>J.  Let  ^(bj)  ■  ♦(bj.a®),  $2(b2)  *  ♦(•j.b,) .  Then 

()i(b1,b2)  ■  ^(bj)  ♦  as  sPeci^ied  by  Theorem  1.3,  The  uniqueness 

clause  follows  from  the  fact  that,  if  $',  are  any  functions  satis¬ 

fying  (i)  and  (ii)  of  Theorem  1.3,  then 

(b1,b2)  -  ♦•(b1)  -  ♦J(aj)  ♦  ^’Cb2)  -  ^(a2) 
is  an  isomoiphism  of  (A/~,+,^)  into  (Re,*,>),  and  so,  by  Holder's  Theorem 
must  differ  from  $  by  multiplication  by  a  positive  constant.  This  completes 
the  proof. 

Construction  of  the  isomorphism  $,  and  thus  of  the  measurement  scales 
$.  ♦j,  4*2 •  depends  on  the  construction  of  a  standard  sequence  in  (A/-, ♦,_>), 
as  in  the  proofs  of  Theorems  1.1  and  1.2.  This  amounts  to  measuring  the 
deviation  of  any  (bj.bp  from  (aj.a^)  in  terms  of  multiples  of  a  small 
unit  deviation  from  (aj,a2). 
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2.  Some  new  problems  generated  by  applying  theories  of  measurement 

in  the  social  sciences. 

2.1  fhe  role  of  measurement  theory  in  social  science 

For  the  physical  scientist,  measurement  theory  is  properly  a  branch 
of  philosophy.  The  axioms  for  extensive  measurement  of  mass,  length, 
or  time  provide  a  foundational  analysis  of  long-established  procedures. 
However,  these  axioms  are  too  trivial  to  claim  the  status  of  laws  of 
physics;  rather,  they  are  obvious  properties  of  measurement  operations, 
and  are  taken  for  granted  in  the  actual  practice  of  measurement. 

Furthermore,  in  testing  nontrivial  laws  that  specify  rules  of  com¬ 
bination  for  2  or  more  variables,  the  physicist  need  not  rely  on  an 
axiomatic  analysis  of  the  sort  provided  by  conjoint  measurement  theory. 

For  example,  the  equation  of  state  for  an  ideal  gas,  pV/T  »  constant, 
and  the  second  law  of  motion,  F  *  ma,  are  stated  in  terms  of  numerical 
scales  obtained  by  extensive  measurement,  and  are  directly  testable  by 
numerical  calculations. 

In  the  social  sciences,  there  are  no  measurement  procedures  comparable 
to  the  ones  used  for  measurement  of  mass,  length,  and  time.  Therefore, 
when  an  axiomatic  theory  of  measurement  is  applied  in  a  social  science 
context,  the  axioms  are  not  obvious  properties  of  long-established  pro¬ 
cedures;  rather,  they  are  a  set  of  proposed  laws,  which  are  not  at  all 
trivial.  Some  of  the  laws  may  be  qualitative,  i.e.,  directly  testable 
by  observations  involving  order  or  class  membership.  The  axioms  of  addi¬ 
tive  conjoint  measurement  are  of  this  sort.  Other  laws  may  be  numerical. 
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for  example,  the  assertion  that  2  variables  combine  additively.  These 
numerical  laws  cannot  be  tested  as  simply  as  in  physics,  since  nc< 
numerical  scales  are  specified  for  the  variables.  Rather,  they  roust 
be  tested  by  searching  for  numerical  scales  that  satisfy  the  laws,  or  by 
testing  other  laws  that  imply  or  are  implied  by  the  given  numerical  laws. 

In  short,  at  the  present  stage  of  development  of  quantitative  theory  in 
social  science,  it  is  impossible  to  separate  the  search  for  interesting 
empirical  laws  from  the  discovery  and  refinement  of  measurement  procedures. 

\s  a  consequence  of  the  above  situation,  measurement  theory  is  of 
more  than  philosophical  interest  for  social  science.  By  providing  an 
axiomatic  theory'  for  various  numerical  laws,  one  proposes  qualitative 
experiments  that  distinguish  between  laws,  and  techniques  of  measurement 
where  none  existed  previously.  One  result  of  the  more  dynamic  and  inte¬ 
gral  role  of  measurement  theory  in  social  science  is  that  the  discoveries  or 
the  difficulties  encountered  in  empirical  studies  constitute  an  important 
source  of  new  mathematical  problems. 

The  next  5  sections  are  devoted  to  an  overview  of  5  areas  in  which 
new  mathematical  problems  have  emerged  from  the  requirements  of  social 
science  quantification:  foundations  of  geometry,  ordered  rings,  theory 
of  models,  semiorders,  and  error  theory.  The  problems  in  foundations  of 
geometry  and  in  ordered  rings  wen*  generated  by  the  attempt  to  axiomatire 
laws  other  than  the  simple  additive  combination  of  variables:  geometric 
laws,  and  polynomial  combination  laws,  respectively.  These  topics  will  be 
pursued  in  more  depth  in  lectures  3  and  4.  The  problems  in  theory  of 
models,  semiorders,  and  error  theory  derive  from  difficulties  in 
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realizing  idealized  primitives  of  measurement  theory,  such  as  total 
orderings,  amid  the  doubts  and  errors  of  real  data. 

2.2  Foundations  of  geometry 

Geometrical  models  are  heavily  used  in  social  science  as  a  basis 
for  quantitative  treatments  of  similarity  or  correlation.  For  example, 
suppose  that  one  has  a  set  of  objects,  A,  and  obtains  some  measure  of 
the  dissimilarity  of  any  2  objects  in  A.  This  measure  gives  rise  to 
an  order  relation  on  A  x  A.  To  represent  the  dissimilarity  ordering 
by  a  geometric  model,  one  tries  to  map  the  objects  of  A  into  a  metric 
space,  where  the  ordering  of  metric  distances  corresponds  to  the  observed 
ordering  of  dissimilarities.  In  1962,  Shepard  [31]  published  a  practi¬ 
cal  method  of  computing  a  representation  for  a  finite  set  A,  in  low- 
dimensional  Euclidean  space,  which  yields  the  best  approximation  (for  a 
given  dimension)  to  the  dissimilarity  ordering  on  A  *  A.  Since  then, 
this  sort  of  measurement  has  been  widely  practiced,  with  little  concern 
over  appropriate  foundations. 

In  terms  of  measurement  theory,  the  problem  of  foundations  may  be 
stated  as  follows:  given  a  set  A,  an  observable  ordering  >_  of  the  pairs 
of  elements  of  A,  and  a  class  C  of  metric  spaces  (the  desired  geometric 
representation),  what  axioms  (empirical  laws)  must  be  satisfied,  in  order 
for  there  to  be  a  metric  d  on  A,  such  that  (A,d)  is  in  class  C,  and  such 
that  (x,y)  (z,w)  it  and  only  if  d(x,y)  >_  d(z,w)?  From  the  viewpoint 

of  the  classical  field  of  foundations  of  geometry,  we  are  asking  for  an 
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axiomatization  of  geometries  of  class  C,  in  terms  of  the  undefined 
(primitive)  notions  of  a  set  of  points  and  a  quaternary  relation  on 
the  points. 

The  classical  axiom  systems  for  foundations  oi  Euclidean  geometry 
(see  Blumenthal  [4])  generally  involve  undefined  notions  of  point , 
congruence  of  point  pairs  (a  quaternary  relation),  and  collinear  between¬ 
ness  of  point  triples  (a  ternary  relation).  Sometimes,  lines,  and 
incidence  of  points  and  lines  are  also  taken  as  primitive.  In  the  study 
of  empirical  similarity,  the  required  primitives  (incidence  of  a  point 
on  a  line,  or  collinear  betweenness)  do  not  seem  to  arise  in  any  natural 
way.  Thus,  the  problem  of  developing  geometric  measurement  theories  for 
similarity  generates  new  problems  in  foundations  of  geometry,  that  is, 
axiomatizing  different  forms  of  metric  geometry  in  terms  of  a  single 
quaternary  relation.  One  such  axiomatization  will  be  presented  in  detail 
in  lecture  3,  and  some  further  possibilities  will  be  mentioned  briefly 
in  lecture  4. 

2.3  Ordered  rings 

Another  source  of  problems  is  found  in  the  gvreral  theory  of  conjoint 

measurement.  Given  a  set  of  factors.  A.,  ....  A  ,  and  an  order  relation 
n  1  n 

>  on  A  *  n  A,,  various  laws  of  combination  for  the  different  factors  can 
i-1  x 

be  considered,  besides  the  additive  law  discussed  in  lecture  1.  The  following 
definition  is  quite  general. 
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DEFINITION  2.1  Let  \  b®  nonempty  sets,  with  >  a  binary  relation 

n 

on  A  •  JI  A..  Let  f  be  a  real-valued  function  of  n  real  variables. 
i-1  1 

We  say  that  (A^,  A^,^)  is  decomposable  relative  to  f  if  there  exist 

real-valued  functions  ♦  ,  ,  ....  *  ,  with  t  defined  on  A  and  on  A., 

such  that  for  a  ■  (aj,  ....  an),  b  ■  (bj,  ....  bn)  c  A, 

(i)  a  >_  b  if  and  only  if  ♦(a)  >.  4(b) 

(ii)  +(«)  -  fl^faj),  ....  C  an )  ] . 

The  function  f  gives  the  rule  of  combination  for  the  variables;  $, 

♦j,  $n  give  appropriate  measurement  scales.  The  problem  of  measure¬ 

ment  theory  is  to  specify  axioms  (empirical  laws)  that  are  necessary 
and/or  sufficient  for  decomposability  relative  to  a  specified  rule  f. 

This  problem  becomes  fairly  tractable  when  the  function  f  is  a  polynomial 
in  n  variables.  Moreover,  quite  a  few  miniature  theories  have  been  pro¬ 
posed,  which  explicitly  posit  polynomial  rules  of  combination  for  a  set 
of  factors. 

For  one  simple  illustration  of  polynomial  combinations  rules,  consider 
the  relationship  of  the  evaluative  (moral)  connotation  of  combinations  of 
quantitative  adverbs  with  adjectives,  as  a  function  of  the  adverb  and 
of  the  adjective.  The  overall  moral  connotation  of  a  combination  such  as 
"slightly  evil"  is  better  described  as  a  multiplicative,  rather  than  an 
additive  combination  of  "slightly"  and  "evil".  To  see  this,  note  that 
"slightly  evil"  would  be  raced  hotter  than  "very  evil",  while  "slightly 
pleasant"  would  be  rated  worse  than  "very  pleasant".  These  opposite 
orderings  of  "slightly"  and  "very"  correspond  to  multiplying  nimierical 
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scale  values  of  the  modifiers  by  moral  values  of  opposite  sign  for 
"evil"  and  "pleasant".  Studies  of  moral  connotation  that  ■’elude 
the  above  examples,  and  use  a  multiplicative  combination  rule,  were 
carried  out  by  Cliff  [  S  j.  Many  other  miniature  theories  involve 
mixtures  of  additive  and  multipli  itive  combinations,  i.e.,  more 
general  polynomials. 

The  basic  tool  for  polynomial  onjoint  measurement  is  the  ring  analog 
of  Holder's  theorem:  an  ArchimeJ  in  ordered  ring  (with  nontrivial  multi* 
plication)  is  isomorphic  to  a  unique  subring  of  the  ordered  ring  of 
real  numbers.  (See  Birkhoff  {  3],  p.  398).  This  tool  can  be  used  in 
at  least  2  ways.  One  procedure  is  analogous  to  the  proof  of  Theorem 
1 . 3  on  additive  conjoint  measurement:  one  introduces  ring  operations, 

♦,  •  directly  into  the  set  of  equivalence  classes,  A/-.  The  definitions 
of  +  and  •  depenJ  on  the  hypothesized  polynomial;  the  required  axioms 
are  those  for  which  'he  system  .A'-,  +,  • ,  *J  becomes  an  Archimedean 
ordered  ring.  A  different  strategy  is  to  let  each  relation  statement  of 
form  ■  1  b  correspond  to  a  suitable  polynomial  inequality.  Obtaining 
the  required  functions  is  equivalent  to  solving  a  set  of  simultaneous 
polynomial  inequalities.  This  leads  to  the  study  of  partial  orders  on 
polynomial  rings.  In  particular,  the  following  question  seems  to  be  un¬ 
solved  and  of  interest:  for  what  classes  of  partially  ordered  rings  is 
an  extension  possible  to  an  Archimedean  total  order?  This  problem  is 
discussed  by  Tverskv  [3:.j;  for  soeie  results  on  extensions  of  partial 
orders ,  see  Fuchs  j  i  1 J  . 
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2.4  Factorial  designs;  theory  of  models 

n 

(liven  a  binary  relation  ^  on  IT  A.  ,  a  common  experimental  procedure 

i“l  1 

is  to  sample  a  finite  subset  B,  of  A. ,  i  *  1,  ...,  n,  and  to  observe 

n 

the  ordering  only  on  n  B. .  This  is  called  a  factorial  design.  Certain 

i  *1  1 

axioms  of  polynomial  conjoint  measurement  theories,  such  as  solvability  and 
Archimedean  axioms,  are  untest  able  in  such  an  experiment.  But  even  if 
the  untested  axioms  are  valid  in  the  entire  empirical  system 
(Aj ,  ....  A  ,>J  ,  while  the  testable  axioms  are  verified  in  (B^,  ..., 
it  may  still  be  false  that  ( B ^ ,  ...,  is  decomposable  relative  to 

the  polynomial  combination  rule  in  question.  Testing  this  decompos ability 
amounts  to  searching  for  a  simultaneous  solution  to  a  finite  set  of 
polynomial  inequalities,  a  problem  which  is  computationally  demanding 
and  for  which  there  seems  to  be  no  general  algorithm. 

Thus,  the  problem  arises  of  ariomatizing  polynomial  combination  rules 
for  finite  systems.  Here,  the  theory  of  models,  developed  by  Tarski  [34], 
is  relevant.  Using  results  from  this  theory,  Scott  and  Suppes  [ 30}  proved 
a  theorem  that  implies  that  there  is  no  finite  axiomatiiation  for  finite 
system*  of  additive  conjoint  measurement,  by  universal  sentences  in  the 
first  order  functional  calculus.  One  may  conjecture  that  there  is  no  finite 
axiomatiiation,  in  first-order  factional  calculus,  for  any  system  of 
polynomial  conjoint  me as u remen t . 
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2.5  Semiorders 

The  binary  or  quaternary  relations  of  extensive,  conjoint,  or  metric 
space  measurement  are  generally  assumed  to  be  transitive.  In  practice, 

2  types  of  intransitivity  are  observed: 

(i)  x  ■  y,  y  '  z,  but  x  >  i, 

(ii)  x  >  y,  y  >  z,  but  z  >  x. 

The  first  type  may  occur  because  differences  between  x  and  y  and  between 
y  and  z  are  too  small  to  be  detected,  but  add  up  to  a  detectable 
difference  between  x  and  z. 

Type  (i)  intransitivities  occur  in  a  formal  system  called  a  eemi- 
order ,  introduced  by  Luce  (21j.  ^his  involves  2  binary  relations,  P 
(strict  preference)  and  I  (intransitive  indifferer.ee). 

Let  juxtaposition  denote  the  usual  relation  product,  i.e.,  x  PI  i 
if  there  exists  y  such  that  xPy  and  viz;  let  P*  be  the  reflection  of 

P  in  the  diagonal,  i.e.,  x  P*  y  if  ypx.  We  can  define  a  seaiorder 

as  follows. 

DEFINITION  2.2  (X.P.I)  is  a  $«mcrjer  if  X  is  a  set,  and  P ,  I  are  binary 

relations  or.  X,  such  that 

1.  {P,P*,I >  is  a  partition  of  X  *  X 

2.  PIP  c  P 

3.  P^  0  I  *  is  empty . 
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x  *-y  y  » 


Fig.  2.  Illustration  of  axioms  2  (left)  and  3  (right)  for  a 
semiorder.  The  configuration  on  the  left  implies 
xPw.  The  configuration  on  the  right  is  asserted  to  be 
impossible. 


The  content  of  conditions  2  and  3  is  depicted  in  Figure  2.  In  the 

left  diagram,  xPy,  ylz,  zPw,  and  the  conclusion  is  xPw •  In  the 

2  2 

right  hand  diagram,  x  P  z  (via  y)  and  x  1  z  (via  w) ;  the  asser¬ 
tion  is  that  no  such  configuration  exists. 

One  of  the  main  results  on  semiorders  is  that  any  semiorder  defines  a 
natural  complete  order. 


THEOREM  2.1  Let  (X,P,I)  be  a  semiorder .  Define  x  -  y  if  for  all 
z  e  X,  xlz  iff  ylz,  &<d  x  >_  y  if  neither  y  PI  x  nor  y  IP  x. 
Then  ~  is  an  eouivalence  relation  and  >_  induces  a  total  order  on  the 
equivalence  classes  X/~. 


The  proof  of  this  theorem  is  a  useful  exercise. 

Several  problems  arise  in  connection  with  semiorders.  One  problem  is 
to  axiomatize  various  forms  of  measurement,  replacing  the  usual  order  rela¬ 
tion  by  a  semiorder.  This  can  be  done  in  a  trivial  way  using  the  defined 
total  order  of  Theorem  2.1,  but  the  real  point  is  to  show  that,  in  a 
semiordered  syste.a,  one  can  attain  any  desired  accuracy  of  measurement  from 
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appropriate  finite  sets  of  P,I  observations.  This  has  been  done  for 
extensive  measurement  by  Krantz  [18], 

A  second  type  of  problem  is  to  dcai  with  type  (ii)  intransitivities. 

One  way  to  account  for  these  is  by  assuming  shifts  in  dimensions  that 
determine  the  decision.  For  example,  in  purchasing  a  new  car,  each 
additional  accessory  may  seem  worth  the  added  cost,  but  the  total  cost 
of  several  may  drive  one  back  to  the  basic  model.  Below  same  threshold, 
the  cost  dimension  is  ignored;  above,  it  is  decisive.  One  sight  capture 
this  by  assuning  2  semiorders  (X,Pj,I .),  (X,P2,I2)  over  the  sans®  base  set 
X,  and  defining  the  "lexicographic  product”,  P  *  IJ  (I  ^  fl  , 

T  «  Ij  H  I2  That  is,  xPy  if  xP^y  (the  first  dimension  is  decisive) 
or  if  (1  P2)y.  Obviously,  P  need  not  be  transitive.  The  interesting 

question  is  to  characterize  lexicographic  products  of  semiorders:  given 
a  pair  of  relations,  P,I,  what  properties  guarantee  the  existent j  of 

Pi , 1 1 ,  such  that  and  (X*P2»I2^  are  semiorders  and  (PJ) 

is  their  lexicographic  product?  In  empirical  terms,  can  one  infer  the 
latent  dimensional  structure  from  a  pattern  of  intransitivities? 

2.6  Error  theory 

One  of  the  most  serious  bars  to  testing  the  axioms  of  various  measurement 
theories  is  the  presence  of  "random"  error.  One  way  to  deal  with  this 
difficulty  is  to  s  -perimpose  a  probability  model  on  the  algebr  ic  one. 

For  example,  in  conjoint  measurement,  one  might  assume  that  a  ^air  (a^.aj) 
corresponds  to  a  Gaussian  random  variable  with  expectation  M(alta2);  one 
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might  interpret  (a^a^)  _>  (b^bj)  to  mean  that  Mfaj^) 

Axioms  such  as  transitivity  or  cancellation  [1,  3  of  (1.4)]  are  testable 
statistical  hypotheses. 

A  criticism  of  the  conventional  statistical  approach  is  that,  if 
the  measurement  axioms  are  satisfied,  then  the  construction  of  measurement 
scales  induces  transformations  of  the  random  variables.  It  is  at  least 
as  reasonable  to  assume  that  the  transformed  random  variables,  rather  than 
the  original  ones,  satisfy  a  tractable  probabilistic  model,  but  this 
greatly  complicates  the  statistical  treatment. 

More  generally,  one  may  wish  to  deal  with  random  error  in  a  manner 
that  is  less  arbitrary  than  assumption  of  a  special  probabilistic  model. 

It  may  be  desirable  to  incorporate  error  processes  more  directly  into  the 
system  of  primitives  and  axioms. 

An  extreme  version  of  the  incorporation  of  error  processes  into  a 
measurement  axiomatization  is  to  base  the  measurement  entirely  on  error. 

For  example,  given  a  family  of  real-valued  random  variables,  one  may  seek 
a  transformation  of  the  real  numbers  such  that  the  transformed  random 
variables  are  identically  distributed  except  for  translations.  If  this 
can  be  done,  the  transformation  in  question  provides  a  measurement  scale 
that  regularizes  the  error  theory.  Levine  [20]  explored  this  problem 
quite  deeply;  among  other  results,  he  showed  that  if  such  a  transformation 
exists,  for  a  family  of  3  or  more  random  variables,  then  it  is  unique  up  to 
changes  of  origin  and  unit,  i.e.,  we  have  interval  scale  measurement. 

Thus,  the  error  theory  has  run  away  with  the  measurement  procedure--there 
is  no  longer  any  room  for  basing  measurement  on  an  extensive  operation, 


a  geometric  model,  or  a 
manner  of  incorporating 
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polynomial  combination  rule.  Some  intermediate 
random  error  in  the  measurement  process  would 


seem  desirable. 
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3.  Metrics  with  additive  segments 

3.1  Preview 

a  metric  space  (X,d)  is  a  metric  with  additive  eegmente  if  for  any 
x,z  e  X,  there  is  an  isometry  f  of  the  real  interval  [0,d(x,z)]  into 
X,  such  that  f(0)  *  x,  f(d(x,z))=  z.  Most  metric  spaces  studied  in 
geometry  are  of  this  type:  e.g.,  Riemannien  spaces,  or  G-spaces  (Buse- 
mann,  [7}).  In  this  lecture,  I  examine  the  foundations  of  metrics  with 
additive  segments ,  starting  with  an  ordering  of  pairs.  More  precisely, 
given  a  set  A,  and  an  ordering  _>  on  A  *  A  (or  a  mapping  (x,y)  -*•  xy 
of  A  x  A  onto  a  totally  ordered  set,  (P,^)),  what  axioms  guarantee  the 
existence  of  an  order-preserving  real-valued  function  4  on  P  such  that  for 
d(x,y)  ■  * (xy) ,  (A,d)  is  a  metric  with  additive  segments?  The  source 

of  this  problem  is  the  demand  fur  a  geometric  model  of  dissimilarities, 
di scussed  in  (2.2). 

The  key  to  analyzing  foundations  of  metrics  with  additive  segments  is 
the  ternary  relation  < xyz),  which,  in  terms  of  a  metric,  can  be  defined 
as  d(x,y)  ♦  d(y,z)  -  d(x,z).  We  must  define  this  relation,  and  establish 
its  main  properties,  using  the  ordering  alone.  Once  this  is  done,  we 
define  a  binary  operation  in  P  as  follows:  xy  ♦  x'y'  ■  uw  if  xy  ■  uv, 
x'y’  •  vw ,  and  <uvw^.  This  operation,  however,  cannot  necessarily  be 
defined  for  all  pairs  (xy.x’y')  (there  may  not  exist  additive  segments 
of  arbitrary  length).  Thus,  in  order  to  apply  Holder's  theorem  to  the 
system  (P,*,D,  we  must  establish  a  version  of  it  that  applies  when  the 
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binary  operation  is  defined  only  for  sufficiently  small  elements.  This 
sort  of  local  theorem  has  other  important  applications.  In  the  next 
section.  I  shall  state  it,  sketch  its  proof,  and  indicate  the  applications 
to  extensive  and  conjoint  measurement.  In  succeeding  sections,  I  shall 
return  to  the  question  of  metrics  with  additive  segments. 

3.2  A  bounded  version  of  Holder's  Theorem 

DEFINITION  3.1  Let  G  be  a  set,  with  binary  relations  _>,  B  on  G,  and  a 
binary  operation  (x,y)  -*■  x  ♦  y  from  3  to  G.  The  quadruple  (G.B,*,^) 
will  be  called  a  positive  ordered,  loodl  semigroup  if  the  following  are 
true  for  all  x,y,z,x',y'  e  G: 


1. 

>  is  a  total  order 

2 

if  (x.y)  c  B,  x  >_  x' 

f  y  1  y' ,  then  (y? ,x')  c 

B 

3. 

if  (x,y) ,  (x*y,z)  e  E 

,  then  (y, z) ,  (x,y*z)  e  B 

and 

(x  *  y)  ♦  i  •  x  *  (y  ♦ 

z) 

4. 

if  x  >_  y  and  (x.z) 

e  B,  then  x  +  z  y  +  z 

and 

z  ►  x  >  z  ♦  y 

5. 

if  (x,y)  c  B,  then 

x  ♦  y  >  x 

6. 

if  z  >  x,  then  there 

exists  y  c  G  with  (x,y) 

c  B  and 

z  >  x  ♦  y. 

A  positive  ordered  local  semigroup  is  Archimedean  if  for  all  x,y  c  G, 
{n|nx  defined,  y  _>  nx}  is  finite. 

Note  that  by  property  2,  (x.y)  l  B  iff  (y,x)  c  B;  from  this,  we 

know  that  y  ♦  z,  z  ♦  x,  i  ♦  y  *r«  defined  in  property  4. 
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THEOREM  3.1  Let  (G,B,+,>)  be  an  Archimedean  positive  ordered  local  semi¬ 
group.  Let  G'  ■  {x|3y,  (x,y)  t  B}.  Then  there  is  a  real-valued  function 

$  an  G'  such  that  for  all  x,y  e  G' 

(i)  x  ■>  y  iff  $(x)  _>  $(y) 

(ii)  if  (x,y)  e  B,  then  <#•  (x  +  y)  ■  ♦(x)  ♦  4> (y) . 

Moreover t  if  are  any  2  such  functions ,  then  *  asp  for  some 

a  >  0. 

The  proof  of  Theorem  3.1  is  like  that  of . Theorem  1.1  in  all  essen¬ 
tial  details.  We  note  only  2  slight  differences.  First,  for  x,y  e  G', 
y  >  x,  define  N(x,y)  to  be  the  largest  n  for  which  nx  is  defined  and 
y  >_  nx.  If  (x,y)  e  B,  then  y  >  nx  implies  (n  ♦  l)x  is  defined; 

hence,  for  (x.y)  e  B,  we  have  [N(x,y)  *■  l]x  >  y  >_N(x,y)x,  as  in 
Theorem  1.1. 

Second,  the  use  of  inverses  in  Theorem  1.1  is  solely  to  provide 
elements  of  form  y_1x,  where  y  <  x.  The  same  effect  is  achieved  here  by 
finding  y'  such  that  y  ♦  y'  <_  x,  using  property  6  of  Definition  3.1. 

Theorem  3.1  is  clearly  applicable  to  extensive  measurement,  for  the 
case  where  there  is  a  practical  upper  bound  on  the  size  of  elements  that 
can  be  compared.  This  has  been  discussed  by  Luce  and  Marley  [23].  Less 
obvious  is  the  application  of  the  theorem  to  a  more  realistic  version  of 
conjoint  measurement .  The  solvability  axiom  (Axiom  2  of  (1.4))  essentially 
forces  the  set  A/'  to  be  a  subgroup  of  real  nunbers,  whereas  in  practice, 
one  would  like  to  restrict  attention  to  a  bounded  subset  of  such  a  subgroup. 
This  corresponds  to  the  fact  that,  one  cannot  in  practice  always  solve 
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equations  of  the  form  a  -  (bj,b2)  for  b 2,  given  a,  bj.  The 
change  on  the  first  factor  may  be  so  large,  as  to  be  unmatchable  within 
Aj.  To  deal  with  this  case,  solvability  has  been  replaced  by  a  much 
more  realistic  assunption  (Debreu  [9];  Luce  [22])  cal  led  r*8triot*d 
solvability: 

2'.  For  all  a  e  A,  bj  c  Aj,  if  there  exist  Cj,  such  that 
Cb1,c2)  >  a  >  (bj.c^),  then  there  exists  c2  such  that 
a  >  (b,,c2);  and  a  similar  assumption  with  the  roles  of  Aj, 

A2  interchanged. 

In  this  more  restricted  situation,  one  can  order  "positive  differences" 
between  elements  of  Aj  by  comparison  with  a  "difference"  in  A,:  namely  define 
al  *  L  ai  “  b{  there  exist  a2>b2  e  A2  such  that 
(®1»^2^  i  (hj,a2)i  (b|,a2)  ^  (a|,b2). 

Certain  positive  differences  can  then  be  "added"  by  laying  off  equivalent 
differences  end-to-end.  An  additional  axiom,  similar  to  the  cancellation 
axiom  of  (1.4),  is  required,  and  minor  modifications  of  the  Archimedean 
axiom  (Axiom  4  of  (1.4))  are  needed,  but  given  these,  the  bounded  Holder's 
theorem  can  be  applied  to  the  system  of  positive  differences  on  each 
factor,  Aj  and  A2<  Ultimately,  this  leads  to  the  same  conclusion  as  that 
of  Theorem  1.3,  based  on  much  weaker  assumptions*. 


This  use  of  positive  differences  on  each  component  in  a  system  of 
additive  conjoint  measurement  is  unpublished;  it  draws  on  material 
from  a  book  in  preparation,  by  R.  D.  Luce,  P.  Suppes,  A.  Tversky,  and 
the  present  author.  For  a  slightly  different  treatment  based  on 
Axiom  2',  see  Luce  (22). 
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3.3  The  ternary  relation  <xyz) 

We  return  now  to  consideration  of  a  set  A,  and  a  mapping  (x,y)  -*•  xy 
of  A  *  A  onto  a  total  order  (P,>) .  If  there  exists  a  function  $  from  P 
to  the  reals,  which  preserves  order,  such  that  d(x,y)  =  ♦(xy)  is  a 
metric  with  additive  segments,  then  the  following  4  axioms  are  easily  seen 
to  be  necessary. 


1. 

For  x  f  y, 

xx  =  yy  <  xy. 

2. 

xy  -  yx. 

3. 

If  xy  <_  uw 

,  then  there  exists  v 

such  that  xy 

*  uv  and 

<uvw>. 

4. 

If  x  i  y. 

then  for  any  u,w  there 

exist  xQ,  .. 

. ,  xn  such 

that  xQ  - 

u,  x^  *  w,  and  for  i 

«  1,  ... ,  n, 

x.  .x.  <  xy 
t-1  t  -  7 

Axioms  1,  2,  and  4  are  stated  entirely  in  terms  of  the  ordering  >_,  but 
Axiom  3  involves  the  ternary  relation  (uvw}.  In  terms  of  the  desired 
metric,  this  means  d(u,v)  ♦  d(v,w)  »  d(u,w).  However,  we  shall  define 
<  >  in  terms  of  the  ordering  alone.  We  do  this  by  noticing  that,  if 

d(x,y)  ♦  d(y,i)  «*  d(x,z),  then  the  distance  d(y',i)  achieves  a  minimum 
at  y'  ■  y,  for  all  points  y'  on  or  inside  the  sphere  with  center  x  and 
radius  d(x,y).  This  characteriiation  of  y  uses  only  ordinal  relations. 

DEFINITION  3.2  ^xyz^  if  for  all  x',  y',  z'  such  that  x'y'  xy  and 
xi  both  of  the  following  hold: 

(i)  yi  1  y'i’ 

(it)  if  yz  •  y'z',  then  xy  ■  x'y'  and  xz  ■  x'z'. 

Define  <xyi)  if  both  <xyz)L  and  <zyx>{. 


i 

> 
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Roughly  speaking,  <xyz>L  holds  if  yz  is  xinixal  axong  all  y'z* 
such  that  y*  is  inside  a  sphere  of  radius  xy  and  x'  is  outside  a  concentric 
sphere  with  radius  xz.  The  relation  <xyz)  is  sinply  the  syxnetric  for* : 
clearly,  <xyz}  iff  <zyx^.  Henceforth,  in  Axiom  3  above,  the  relation 
<  >  will  be  understood  to  be  the  one  defined  by  Definition  3.2.  Fro* 

this  definition,  we  obtain  the  following  useful  lexxa  (only  Axiox  2  is 
used  in  the  proof). 

LBMA  3.1  If  <xyz),  x'y'  £  xy#  y'*'  <.yz,  and  xz  £x'z',  then 
^x'y’z'). 

This  follows  because  if  any  inequality  were  strict,  then  (ii)  of  the 
definition  would  yield  a  contradiction  of  one  of  the  other  inequalities. 
Hence  x'y'  »  xy,  y'z'  ■  yz,  and  xz  ■  x’z',  and  ^x,y,z'^  follows. 

We  also  note  that  from  Axioms  1  a  id  2  and  Definition  3.2,  if  <xyz^, 
then  xy.yz  <.  xz.  Given  these  preliminary  results,  we  can  prove  the. 
following  fundamental  theorem. 

THEOREM  3.2  If  Axioms  1-3  holdt  and  if  <xyz>  and  (xzwX  then 
<yzw>  and  <xyw}. 

Proof:  First  we  prove  (yzw^.  From  xz  <_  xw  and  (xyz^,  we  have 
yz  ^  yw.  By  Axiom  3,  choose  z’  such  that  yz  ■  yz'  and  ^yz'w^.  By 
Lemma  3.1,  it  suffices  to  show  zw  <_  z’w.  Since  <xyz^  and  yz'  <_  yz, 
we  have  xz'  <_  xz;  but  then,  by  (xzw>^,  *w  <_  z'w  follows  as  required. 
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Next  we  show  <xyw>.  Note  that  wy  <_  wx;  otherwise,  if  wx  <  wy, 
then  <wzx)L  implies  zx  <  zy,  contradicting  <zyx^.  By  Axion  3, 
choose  y'  with  wy  »  wy'  and  <wy'x^.  By  Leona  3.1,  it  will  suffice  to 
show  that  xy  <,  xy' . 

From  tym},  wz  ^wy'.  Construct  z'  with  wz  ■  wz'  and  <wz'y'X 
Suppose  xy'  <  xy.  Then  by  <xyz>,  either  xz'  <  xz,  or  yz  <  y'z'. 

The  former  is  false,  since  <xzw}  and  xz'  <  xz  imply  zw  <  z'w;  and 
the  latter  is  wrong,  because  <wz'y'^,  wz  ■  wz',  and  wy  ■  wy'  imply 
y'z'  <_  yz.  Hence,  we  conclude  that  xy  <.xy’,  as  required.  This  completes 
the  proof  of  Theorem  3.2 

Theorem  3.2  states  the  basic  property  of  the  ternary  relation  <  >, 

which  is  needed  to  construct  a  metric;  as  will  be  seen  in  the  next  seciton, 
it  corresponds  to  associativity  of  the  operation  ♦  defined  on  P. 

3.4  Existence  and  uniqueness  of  a  metric 

DEFINITION  3.3  Let  A,P,  be  as  above,  and  let  <  ^  be  given  by 
Definition  3.2.  Define  a  binary  operation  ♦  on  P  by  xy  *  x'y'  ■  uw 
if  xy  «  uv,  x'y'  *  vw,  and  <  uvw). 

We  note  that  this  is  well  defined.  Let  Pj  ■  P  -  {xx},  and  let 

B  *  {(xy,x'y') |xy  +  x'y'}  is  defined  .  Then  the  following  theorem  can  be 

proved. 
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THEOREM  3.3*  If  Axioms  1-3  of  (3.3)  hold ,  than  (Pj.B.-s^  in  a  position 
ordered  local  semigroup ;  if  Axiom  4  also  holds,  than  it  is  Arokimsdsan 
ae  well.  Hence,  if  Axioms  1-4  hold,  than  there  is  a  metric  A  on  A  such 
that 

(i)  xy^x'y’  iff  d(x,y)  >.  d(x‘ ,y‘) 

(ii)  <xyz>  iff  d(x,y)  ♦  d(y,z)  ■  d(x,z). 

Moreover,  d  is  unique  up  to  similarity  transformations  (i.e.,  A  is  a  ratio 
scale) . 

Proof:  The  proof  that  (Pj,B,+,>)  is  a  positive  ordered  local  semigroup 
if  Axioms  1-4  hold  is  almost  immediate.  To  illustrate*  we  prove  associa¬ 
tivity  (in  the  sense  of  property  3,  Definition  3.1).  Suppose  (xy^'y1) 
and  (xy  ♦  x,y,,x,'y,,)  e  B.  Let  uv  ■  xy  ♦  x’y’  and  vw  ■  xMy",  with 
<uvw>.  Let  u'v*  =  xy,  v'w'  ■  x’y*,  with  ^u’v'w'^.  Then  u'w'  ■  uv. 
Since  u'v'  <_  uv,  there  exists  z  with  u'v*  ■  uz  and  <uzv}.  By 
Theorem  3.2,  <zvw>  and  <uzw}.  By  Definition  3.2,  zv  »  x'y'.  Thus, 
x’y'  +  x^y"  is  defined  and  *  zw;  so  xy  ♦  (x'y'  ♦  xMy")  is  defined 
and  *  uw  »  (xy  +  x'y')  ♦  x"y". 

From  Theorem  3.1,  there  exists  a  real-valued  function  ♦  on  Pj  such  that 
xy  >_  x'y'  iff  $(xy)  >_  (Hx'y1)  and  4(xy  ♦  x’y')  »  ♦(xy)  ♦  ♦(x,yt).  We 
note  that  there  is  at  most  one  p  e  Pj  -  pj,  i.e.,  a  maximal  element  of 
P,  if  such  exists.  We  define  d  on  A  *  A  by 
d(x,y)  *  f  4>(xy)  if  xy  e  Pj, 

<0  if  x  -  y, 

^  sup{<|>(uv)  |  uv  e  Pj}  if  xy  is  maximal. 

♦ 

The  results  in  f 3. 3) - (3. S)  are  essentially  due  to  Beals  and  Krantz  [2]. 
They  considered  a  somewhat  more  general  situation.  The  version  presented 
here,  particularly  Theorem  3.3,  draws  on  unpublished  material  from  a  book 
in  preparation  by  Luce,  Suppes ,  Tversky,  and  Krantz. 
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Obviously,  xy  >  x'y'  iff  d(x,y)  >  d(x’ ,y!)  and  < xyz>  iff 
d(x.  1  *  d(y,z)  =  d(x,z).  The  triangle  inequality  follows  from  the 
definition  of  <  ).  Also,  by  Axiom  4,  if  xy  is  maximal  in  P,  then 
sup{ifc(uv)|uv  e  Pp  is  finite.  Thus,  d  is  a  metric  satisfying  (i)  and 
(ii).  Clearly,  any  other  metric  d!  with  the  same  properties  defines  a 
function  on  P‘  with  the  same  properties  as  $;  $’  =  a$,  hence, 

d'  =  ad;  follow  from  the  uniqueness  assertion  of  Theorem  3.1.  This 
completes  the  proof  of  Theorem  3,3. 

3,5  Existence  of  segments 

Note  that  Axioms  1-4  of  (3.3)  can  be  satisfied  by  the  set  A  -  {x,y,z}. 
with  xy  *  yz  <  xz.  In  fact,  ^xyz)  holds,  and  the  only  d  satisfying 
Theorem  3.3  is  given  by  ci(x,y)  =  d(y,z)  *  D,  d(x,2)  =  2D,  where  D  >  0 
is  arbitrary.  Such  finite  examples  are  avoided  if  we  impose  the  requirement 
that  any  x,z  e  A  be  joined  by  an  additive  segment,  as  defined  in  (3,1). 

One  way  to  guarantee  this  is  to  impose  2  additional  conditions,  nondis- 
creteness  and  completeness: 

5.  P  -  {xx}  has  no  minimal  element. 

6.  If  x,  is  a  sequence  in  A  such  that  for  u  i  v,  x^x^  £  uv  for 
all  but  finitely  many  (i,j),  then  there  exists  y  e  A  such  that 
for  u  ^  v,  x.y  £  uv  for  all  but  finitely  many  1.  That  is,  any 
Cauchy  sequence  converges . 

We  call  a  subset  Y  of  A  a  partial  segment  from  x  tc  z  if  (i)  x,z  e  Y 


and  (ii)  for  any  u,v  e  Y,  <xuv^  or  (xvu).  The  set  y  is  a  segment  from 
x  to  z  if  it  is  a  maximal  nartial  segment  from  x  to  z.  By  Zorn's  lemma. 
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for  any  x,z,  there  exists  a  segment  y  from  x  to  z.  We  wish  to  show  that 
y  is  isometric  to  [0,d(x,z)j.  For  y  e  y,  let  f(y)  *  d(x,y).  Obviously, 
d(u,v)  3  j  f (u)  -  f(y)|,  so  f  is  an  isometry  of  y  into  [0,d(x,z)j,  It 
remains  only  to  show  that  f  is  onto.  For  this,  we  use  Axioms  5  and  6 
above . 

Let  t  be  e  (0,d(x,z)).  Let  y.  =  {y  e  y|f(y)  <_  t},  y2  »  (y  e  yjf(y)  >_tL 
From  Axiom  6  it  is  easily  shown  that  f  attains  its  maximum  in  y^  at  some 
y1  e  Yj  and  its  minimum  in  y  at  y2  e  y2>  For  example,  if  is  a 
sequence  in  y^  such  that  d(x^)  -*•  sup{f(y)|y  c  y^,  then  by  Axiom  6, 

•+  y^  e  A,  and  it  is  easy  to  see  that  y  U  {y^}  is  a  partial  segment. 

It  follows  by  maximality  that  y^  e  y;  y2  is  treated  similarly.  If 
•yl  =  y2’  then  *^y’)  3  *(y2)  *  recluiret*  preimage  of  t  in  y 

has  been  constructed.  But  for  yj  i  y2,  we  can  use  Axioms  5  and  3  to  choose 
y  with  <y1yy2>,  and  y  V \>Y 2'  8y  construction,  y  {  y,  but  by 
Theorem  3.2,  y  U  {y}  is  a  partial  segment,  contradicting  the  maximality 
of  y.  Thus,  Vj  =  y2  as  required.  This  completes  the  proof  of  the 
following  theorem  (whose  converse  is  obviously  true  also): 

THEOREM  3.4  Let  A,P ,■>  satis fy  Axiave  1-6.  Then  there  ie  a  metric  d  on 
A,  unique  up  to  multiplication  by  a  positive  constant,  such  that  (A,d)  ie 
a  complete  metric  space  with  additive  segments,  and  such  that  xy  >_  x'y' 
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4.  Polynomial  measurement  theories 


4.1  Independence  and  sign -dependence 

n 

If  is  a  binary  relation  on  n  A.  ,  where  n  ^  2,  it  induces  other 

i  =  l  1 

binary  relations  on  products  of  any  m  factors,  where  m  <  n,  For  example, 

n  m 

if  b  is  a  fixed  element  of  H  A.  ,  and  a, a'  are  elements  of  II  A.  , 

im+1  1  i=l  1 

we  can  define 

a  >^(b)  a'  if  (a,b)  >_  (a',b). 

n 

Thus  any  choice  of  a  fixed  be  n  A.  induces  a  relation  >_(b)  on 
m  i=m+l 

n  A..  Similarly,  choosing  fixed  components  in  any  subset  of  factors 
i=l  1 

induces  a  binary  relation  over  the  product  of  the  remaining  factors. 

One  of  the  things  that  makes  the  study  of  polynomial  combination  laws 

fruitful  is  that,  for  binary  relations  obeying  such  laws,  the  induced 

relation  ^(b)  varies  in  regular  and  interesting  ways  as  a  function  of 

the  vector  of  fixed  components,  b.  Recall  that,  according  to  Definition 

2.1,  (Aj ,  ...,  An>>)  satisfies  a  polynomial  combination  law  f  provided 

n 

that  there  are  real-valued  functions  on  A.  and  <p  on  n  A. ,  such 

i»l 

that  <J>  is  order-preserving  and  <j>  *  f($j,  4>n) .  A  simple  example 

of  regularity  of  induced  relations  occurs  if 

f(x  ,  ....  x  )  *  g(x.,  ...,  x  )  ♦  h(x  . ,  . ..,  x  ).  In  that  case,  it  is 
i  n  i  m  1  n 

n 

obvious  that  ^(b)  is  independent  of  b,  for  ben  A.;  the  ordering 
m  i*m+l 

of  elements  of  n  A^  depends  only  on  the  values  of  gO^,  ....  4^) . 

i=l  m  n  m 

We  say  in  this  case  that  II  A.  is  independent  of  11  A. .  In  the 

i“l  n  i-m+1 

simplest  case,  f(x^,  ....  xn)  =  £  x.,  and  any  subset  of  factors  is 

"  i  =  l 

independent  of  its  complement.  We  say  in  this  case  that  the  system 
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(Aj ,  ....  An,>)  is  completely  independent;  complete  independence  is  thus 
a  necessary  condition  for  additive  conjoint  measurement  in  n  factors. 

A  subset  of  factors  can  be  independent  of  a  proper  subset  of  its 
complement,  even  when  it  is  not  independent  of  the  entire  complement; 
this  occurs  when  an  induced  relation,  >_(b,c),  does  not  depend  on  the 
vector  of  components  represented  by  b,  for  fixed  c.  For  example,  when 
f'Xj^.X-j)  *  (Xj  ♦  x2^x3»  Ai  is  independent  of  A2,  and  vice  versa£ 
although  Aj  need  not  be  independent  of  A^  *  A^. 

If  f(xr  ....  xn)  =  gfXj,  xm)  •  h(xm+1,  ...,  xn),  then  the 

m 

induced  ordering  of  elements  in  n  A^  depends  not  only  on  g(tj,  ...»  $>m) 

i= 1 

but  on  whether  h($m.,  . ..,  4>n)  is  positive,  negative,  or  zero.  Thus, 

n  A.  can  be  partitioned  into  at  most  3  subsets,  S+,  S^,  S" .  All 
i=m*l  1  + 

induced  orders  >_(b)  are  identical  for  b  t  S  ,  and  are  the  reverse  of 

>_(b)  for  b  e  S,  while  _>(b)  is  degenerate  (the  universal  relation) 

0  “ 

for  b  e  S  .  We  express  this  partition  property  by  saying  that  n  A. 

n  i«l  1 

is  eign- dependent  on  n  A. .  Independence  of  is  the  special  case  of 

i=m+l  1  + 

eign- independence  on  in  which  only  S  or  S  is  nonempty.  Thus,  for 


(Xj  +  x2^x3»  Ai  x  A2  A3  are  mutually  sign-dependent.  It  is  not 

true,  however,  tha*  Aj  *  A^  is  sign-dependent  on  A^.  In  fact,  for 


(X1  ♦  y2)x. 


(xj  *  y2)x* 


X1X3  '  X1X3 

- — ; - 

X3  '  X3 


so  that  the  value  of  y2  *  $2(a2)  at  w*l*c*'  t^'e  order  of  (Xj,x,)  and 
(xj.xp  reverses  is  not  fixed,  leading  to  a  partition,  but  varies  with 

*r  3#  r  3‘ 
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Independence  and  sign-dependence  properties  are  good  examples  of 
qualitative  laws.  Independence  is  subject  to  straightforward  experiisental 
testing,  wherever  order  relations  can  be  determined;  when  satisfied,  it 
strongly  indicates  that  the  effects  of  certain  variables  can  be  evaluated 
apart  from  consideration  of  the  fixed  values  of  other  variables.  Sign- 
dependence  is  similarly  testable;  moreover,  it  has  a  kind  of  special 
"flavor",  since  a  large  effect  can  be  produced  in  2  completely  different 
ways,  by  combination  of  two  "positive"  values  or  of  two  "negative"  values. 
An  instance  of  sign-dependence  was  discussed  in  section  (2.4)  where 
moral  evaluations  of  descriptive  adverb -adjective  combinations  were 
studied,  e.g.,  "slightly  evil",  etc.  There,  the  adverb  factor  is  sign- 
dependent  on  the  adjective  factor;  "evil"  is  in  the  S"part  of  A2-  It 
scarcely  needs  experimental  testing  to  show  that  sign-dependence  is  more 
appropriate  than  independence,  and  hence,  that  multiplication  is  more 
appropriate  than  addition.  The  detailed  accuracy  of  a  multiplicative  model 
is,  of  course,  another  question. 

In  general,  particular  polynomials  exhibit  more  or  lers  idiosyncratic 
patterns  of  independence  and  sign-dependence;  thus,  it  is  possible,  on 
the  basis  of  empirical  information  concerning  these  properties,  to 
diagnose  appropriate  polynomial  combination  rules,  or  at  least,  to 
narrow  the  field. 

4.2  Additive  conjoint  measurement  and  independent  dimensions  in 
geometry 

It  was  indicated  previously  that  a  system  of  additive  conjoint  measure¬ 
ment  is  completely  independent.  Surprisingly,  a  partial  converse  can  be 
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proved:  if  (Aj,  ...»  , >_)  satisfies  complete  independence,  restric¬ 

ted  solvability,  and  an  Archimedean  condition,  and  if  n  3,  then 

n 

(Aj ,  ....  An>>)  is  decomposable  relative  to  £  .  (Recall  that 

restricted  solvability  means  that  any  shift  on  n  -  1  factors  can 
be  compensated  by  a  shift  on  the  nth  factor,  given  that  certain  boundedness 
conditions  hold.)  Since  restricted  solvability  and  Archimedean  conditions 
are  usually  assumed  to  be  valid  empirically,  this  means  that  if  n  3 
(in  the  sense  that  there  are  at  least  3  nontrivial  factors),  complete 
independence  is  the  empirical  equivalent  to  additivity  of  the  factors,* 

(No  such  result  holds  for  n  =  2;  in  that  case,  the  cancellation  axiom 
(Axiom  3  of  (1.4))  is  required.)  The  proof  relies  on  the  bounded  Holder 
theorem  established  in  (3.1)  but  it  is  quite  complicated  and  I  shall  not 
present  it  here.  It  can  be  greatly  simplified  if  unrestricted  solvability 
is  assumed. 

It  was  recognized  by  Tversky  [36]  that  additive  conjoint  measurement 

can  be  profitably  applied  to  metric  representation  of  similarity  orderings. 

Suppose  that  we  have  a  dissimilarity  ordering  >.  on  A  x  A,  as  in  section 

n 

3,  but  that  the  set  A  is  endowed  with  product  structure,  A  ■  IT  A.. 

i*l  1 

If  the  ordering  on  A  *  A  is  to  be  represented  by  a  Euclidean  metric  d, 
with  the  sets  A^  as  a  complete  set  of  orthogonal  coordinates,  then  there 


are  functions  <f>.  on  A.  such  that 
i  l 


d(a,b) 


’  n  2 

I  (a.)  -  Mb.)r 


1/2 


A  topological  version  of  this  theorem  was  published  by  Debreu  [9  ]. 
The  present  version  is  due  to  R.  D.  Luce,  and  is  taken  from  material 
for  a  book  in  preparation  by  R.  D.  Luce,  P.  Suppes,  A.  Tversky,  and 
the  present  author. 
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« 

This  equation  can  be  generalized  in  2  ways: 

d(a.b)  *  F  [  WV]  (1) 

dfa.b)  -  F  £  |4»1(a1)  -  ^(b^l,  |*n(an)  -  ^(b^l  j  .  (2) 

In  (1),  F  is  a  strictly  increasing  function  of  1  variable,  while  in  (2), 

F  is  strictly  increasing  in  each  of  the  n  variables.  Equation  (1) 
specifies  that  dimensions  combine  additively  to  determine  dissimilarity, 
while  (2)  specifies  that  the  contribution  of  any  one  dimension  can  be 
represented  by  absolute  differences  of  scale  values. 

If  d  is  not  required  to  be  a  metric,  then  Equation  (1)  is  simply  the 
equation  of  additive  conjoint  measurement,  in  n  variables.  Similarly, 
when  d  need  not  be  a  metric,  then  the  absolute  difference  representation, 
on  any  one  dimension,  can  be  analyzed  by  methods  very  close  to  those  of 
additive  conjoint  measurement  in  2  variables.  (Absolute  difference  repre¬ 
sentations  have  been  extensively  studied  by  Pfanzagl  [28].) 

If  we  combine  the  conditions  for  n-factor  additivity  (across  dimensions), 
for  2-factor  additivity  (within  each  dimension),  and  for  a  metric  with 
additive  segments  (section  3  above),  then,  as  was  shown  by  Tversky,  the 
metric  d  is  constrained  to  the  form, 

d(a.b}  -  F'1  [  F(|  +  .(a.)  -  ♦  ,(b,)|)  ]  (3) 

L  i«l  11  J 

where  F  is  an  increasing  function,  satisfying  F(a  ♦  8)  _>  F(a)  ♦  F(6). 


If  F(o)  *  a  ,  1  <_  i  «  •,  then  d  is  the  Minkowski  r-metric.  I  do 

not  know  what  other  functions  F,  if  any,  yield  a  metric  with  additive 

segments  satisfying  Equation  (3);  but  if  the  additive  segments  are 

required  to  lie  on  algebraic  straight  lines  (in  the  coordinates 

<t>j(aj),  ...»  $n(an))»  t^e  power  is  the  only  solution.  These 

considerations  lead  to  an  axiomatization  of  the  Minkowski  r-metrics 

n 

in  terms  of  the  primitives  A  *  n  A.  and  >_  on  A  *  A.  The  Euclidean 

i  =  l  1 

case,  r  *  2,  can  be  distinguished  by  numerous  properties,  for  example, 
the  fact  that  all  rotations  are  isometries. 

If  we  let  F(o)  =  era  -  1,  r  >  0,  then  (3)  yields  a  metric,  but 
only  points  differing  in  exactly  one  dimension  can  be  joined  by  an  addi¬ 
tive  segment.  The  geometry  of  this  ’’exponential  metric"  seems  not  to 
have  been  studied. 

4.3  Simple  polynomials 

We  now  return  to  consideration  of  polynomial  combination  rules  more 
general  than  additivity.  The  independence  or  sign-dependence  properties 
that  are  logically  necessary  for  a  given  polynomial  are  usually  not 
sufficient,  even  if  appropriate  solvability  and  Archimedean  conditions  are 
assimed.  In  this  respect,  the  pure  additive  and  pure  multiplicative 
rules  are  exceptional.  For  one  class  of  polynomial  combination  rules, 
called  simple  polynomials,  there  is  a  general  schema  for  finding  a 
sufficient  set  of  axioms.  This  class  is  defined  as  follows:  (i)  single 
variables  are  simple  polynomials;  (ii)  if  fj  and  f2  are  simple  polynomials 
with  disjoint  variables,  then  *  f,  and  f  f  are  simple  poly- 
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nomials;  (iii)  no  polynomials  are  simple  except  by  virtue  of  (i)  and 
(ii).  More  formally: 

DEFINITION  4.1  Let  F ( Y )  be  a  ring  of  polynomials  in  the  indeterminates 
Y.  Then  S(Y]  is  the  smallest  subset  of  F[Y]  such  that 
(i)  YcS[Y] 

(ii)  if  Yj.Y^  c  Y,  with  Y^  fi  Y2  empty,  then  for  any  f^  e  3 [ Y ^ ] 
and  f^  c  SfY^],  fj  +  f,  and  f  f ,  t  S[Y]. 

The  elements  of  S [ Y ]  are  the  sin, pie  polyyicmiale  of  F[Y]. 

To  axiomatize  polynomial  conjoint  measurement,  relative  to  a  simple 
polynomial  f,  for  (A^ ,  ,  >_) ,  we  proceed  to  introduce  a  series  of 

addition  and  multiplication  operations  in  the  set  A/'.  An  addition  opera* 
tion  is  introduced  for  each  decomposition  of  a  simple  component  of  f  as 
the  sun  of  smaller  simple  components  with  nonoverlapping  variables,  and 
similarly  for  multiplications.  The  manner  of  introducing  these  operations 

is  the  same  as  in  section  (1.4),  e.g.,  (b,,a^)  ♦  (a°.b.)  *  (b  ,b_), 

.  0  .00.  ,  .  .  . 

where  a  *  (aj.Hj)  is  the  origin  (or  unit,  for  multiplication).  How¬ 
ever,  one  must  be  careful  to  keep  the  origin  the  same  for  all  additions, 
the  unit  the  same  for  all  multiplications,  and  to  choose  the  origin  as 
a  multiplicative  zero.  In  addition  to  suitable  sign-dependence,  solva¬ 
bility  and  Archimedean  conditions,  three  classes  of  axioms  need  to  be 
introduced: 

(i)  appropriate  cancellation  conditions,  like  Axiom  3  of  (1.4),  that 
guarantee  that  the  operations  introduced  are  \  el  1 -defined,  commutative, 
and  associative; 
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(ii)  conditions  guaranteeing  that  all  the  addition  operations  have 
the  s tmt  effect,  as  do  all  the  multiplications; 

(iii)  a  condition  that  is  used  to  prove  distributivity  of  multipli¬ 
cation  over  addition. 

Of  course,  all  these  axioms  may  not  be  logically  independent,  so  that  some 
of  them  may  be  eliminable  in  any  given  instance. 

Rather  than  give  complete  abstract  details  of  this  general  schema, 

I  shall  sketch  an  illustration  for  4-factor  conjoint  measurement,  relative 
to  the  polynomial  x^x,  ♦  x^x^. 

We  choose  a  suitable  origin,  a^  =  (a^ , a^ ,a^ ,a^)  and  a  suitable  unit, 
a  *  (e1,a2,a3,a4)  -  (a, .a^.a^.a^ .  Addition  is  defined  by 
(brb2,a3,aJ)  ♦  Uj  ,a2,b3,b4)  *  (bj  »b2 ,b 3,b4 ) . 

Two  different  multiplications  are  defined  by, 

,.  1  0  0,  ,  1  .  0  0,  .  0  0, 

^  ,  &  2  i  ®  j  i  x  8^  ) 

,00.  1 ,  ,  0,  0  1.,  .00... 

^al’a2,b3,a4  ‘  ^al  a2 ’ ’  4  *  lal ,a2 ,b3,b4^ • 

These  definitions  are  chosen  so  that  any  element  b  of  A/-  satisfies 

.  .  .  ,  ..  1  0  0  ,  1  ,  0  0, 

(b j  «b2 ,b3,b4)  *  (bj  ra,  ,a3  ,a4 j  •  Uj ,b, ,a3,a4) 

,  o  0  .  1,  .001., 

+  (Wb3'V  '  U1 'a2’*3’b4)  • 

Once  we  have  constructed  a  ring  isomorphism  $  if  ( A/  ,_>)  into 

1  0  0 

(Re ,♦,♦,>) ,  we  can  define  s,(b.)  *  $ (b , , a. , a. , a, )  ,  etc.,  and  obtain, 

—  11  1  -  3  4 

from  the  previous  equation  and  the  isomorphism  property,  the  desired 


4(b)  *  4i(b,)*2(b1)  ♦  *3lb,) »,(b4) 


relation 
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0  i 

Unlike  the  simple  additive  case,  the  origin  a  and  ur.it  a'  must 
be  chosen  with  some  care,  and  the  solvability  and  cancellation  conditions 
must  be  formulated  to  tike  note  of  exceptions.  !n  particular ,  we  start 
by  assming  that  A^  *  A,  and  *  A^  are  mutually  independent,  while 
and  A 2,  as  well  as  and  A^,  are  mutually  sign-dependent.  For 
i  ■  1,  2,  3,  4,  a?  must  be  chosen  in  the  0- cl  ass  oetermin-sd  by  «igj.- 
dependence,  while  aj  must  be  chosen  outside  the  0-class.  This  anoints 
to  choosing  the  additive  origin  to  be  the  multiplicative  zero,  and  the 
multiplication  to  be  nontriviai.  The  solvability  .ud  cancellation  condi¬ 
tions  must  be  formulated  with  due  care  for  division  by  zero.  Lastly,  some 
delicacy  is  required  in  designating  ♦  and  -  signs,  restive  to  sign-depen¬ 
dence.  Once  all  this  is  done,  and  suitab’o  conditions  of  type  rii)  and 
(iii)  are  imposed,  guaranteeing  that  the  multiplications  coincide  aid 
are  distributive  over  addition,  we  obtain  an  ordered  ring,  as  required. 

To  illustrate  the  derivation  of  type  {iil  snd  (iii)  axioms,  I  indi¬ 
cate  one  *hit  guarantees  distribut >  vitv  of  nultif'-icstion  over  addition. 
Represent  3  arbitrary  equivalence  classes  by 

b  -  (blfa2,a3,a4) 

,  0  0  .  1, 
c  -  (alfa2,b3,a4) 

1  0  0  0  0  1 
d-  (*1  'bn* *3**4)  '  (*1'*2'*3  V’ 

Then  b  •  d  >  c  •  d  *  (b}  ,b2 ,i>3,b4) 
b  ♦  c  *  (b j ,aj,b3<a4) . 

Choose  c,  such  that  (b, .ai.b-.a!1  -  (c, ,a‘  s^.a^l .  Then 
l  I/34  i  ‘  3  4 

fb  ♦  c)  •  d  *  j  ,b,,a3,a4) . 
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Thus,  the  required  axiom  is  one  that  permits  us  to  infer,  from  the 
equivalences 

,  1  .  oo.  ,001, 

Uj ,b2,a3,a4)  -  (a1,a2,a3,b4j 
b i ’ ^2 ’ ^ 3 ’ ^4  (ci *a2,a3’a4^ 

the  conclusion 

(b1,b2,b3,b4J  -  !c1,b2,a3,a4) 

i.e.,  b  •  d  ♦  c  •  d  *  (b  ♦  c)  *  d.  With  some  extra  trouble,  the  required 
axiom  can  be  formulated  as  a  general  condition,  independent  of  the  choice 
of  the  ,  a.,  and  logically  necessary  for  the  desired  polynomial  combina¬ 
tion  in  ie. 

Finally,  note  that  all  the-  axioms  ir. reduced,  except  for  solvability, 
turn  out  to  be  logically  necessary  conditions  for  the  given  polynomial 
combination  rule.  It  wouM  be  interesting  to  reformulate  the  above  treat¬ 
ment  of  polynomial  measurement  with  restricted  solvability,  obtaining 
only  a  subset  of  an  ordered  ring,  but  this  remain*  an  open  problem,  which 
may  involve  considerable  technical  difficulty. 
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5,  The  measurement  of  color 

5,1  Metameric  matcher,  and  vectorial  representation 

The  psychological  laws  on  which  color  measurement  is  based  were 
clearly  enunciated  by  H.  Grassman  in  1853  [12],  and  bear  his  name. 

These  laws,  and  the  measurement  techniques  based  on  them,  are  of  par¬ 
ticular  interest,  because  of  their  simplicity  and  beauty,  and  because 
they  involve  an  unusual  blend  of  physics,  physiology,  and  psychology. 

Hie  empirical  basis  of  color  measurement  involves  a  physically  defined 
binary  operation,  additive  color  mixture ,  .and  a  psychological  equivalence 
relation,  metaneri&m,  Grassman' s  laws,  which  relate  these  primitives, 
have  clearcut  physiological  implications  (Brindley  [5  ],  pp.  198-218). 

Thus,  color  measurement  has  long  been  the  point  of  departure  for  physio¬ 
logical  and  psychological  color  theories. 

The  measurement  representation  for  colors  involves  vtctors  over  the 
real  nutibers,  rather  than  real  nunbers  alone.  International  standards 
for  the  vectorial  representation  of  colors  were  adopted  in  1931  by  the 
International  Commission  on  Illumination  (ICI)  [15].  Extensive  discussions 
of  these  color  measurement  standards  may  be  found  in  the  works  of 
Wright  [37]  and  Stiles  [32]. 

In  this  first  section,  we  assume  that  the  stimulus  whose  color  is 
to  be  measured  consists  of  a  small,  homogeneous  patch  of  light,  viewed 
under  standardized  conditions.  Such  a  stimulus  is  specified  by  a 
function  giving  its  energy"',  or  energy  density,  for  each  wavelength  in 

*  The  term  "energy"  is  used  in  a  broad  sense;  depending  on  the  nature  of 

the  stimulus,  various  measures  derived  from  energy  may  be  more  appropriate, 
e.g.,  power,  power  per  unit  area  of  source,  etc.  For  any  change  of  units 
the  nonnegative  measures  that  correspond  to  olor  stimuli  need  only  be 
altered  by  a  suitable  constant  factor. 
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the  visible  portion  of  the  electromagnetic  spectrin  {wavelengths  from 
-7  -7 

4  *  10  to  7  x  10  meters,  approximately).  Thus,  a  color  stimulus 
may  be  considered  to  correspond  to  a  nonnegative  (energy)  measure  defined 
on  the  Sorel  subsets  of  a  real  interval. 

Additive  color  mixture  means  summation  of  the  energy  in  the  mixed 
stimuli.  For  example,  if  stimuli  b  and  c  are  produced  by  illimdnating 
the  same  portion  of  a  screen  with  light  from  two  different  projectors, 
then  the  mixture,  b  ♦  c,  is  produced  by  turning  both  projectors  on  at 
once.  In  fact,  the  court  ably  additive  real-valued  set  functions  on  the 
Borel  subsets  of  the  visible  spectrum  form  a  vector  space  over  the  reals, 
which  we  denote  B;  the  nonnegative  elements  of  B,  which  correspond  exactly 
to  the  possible  specifications  of  color  stimuli,  form  a  convex  cone  in 

B,  denoted  C.  Additive  color  mixture  corresponds  to  vector  addition  in 

C. 

Two  distinct  elements  of  C  may  correspond  to  color  stimuli  that  look 
alike  in  color.  We  say  that  such  stimuli  are  a  metameAo  match,  or. more 
simply,  are  metaners.  We  denote  the  relation  of  metamerism  by  With 
suitable  experimental  methods,  for  a  normal  observer,  -  can  be  considered 
to  be  an  equivalence  relation  on  C,  to  a  very  high  degree  of  approximation. 
There  are  many  examples  of  metameric  matches:  for  instance,  a  stimulus 
with  a  "bimodal"  energy  distribution,  with  most  of  the  energy  in  the 
"red”  and  "green"  parts  of  the  visible  spectrum,  is  metaaeric  to  an 
appropriate  "unimodal"  stimulus  with  most  of  its  energy  concentrated  in 
the  "yellow"  part  of  the  spectrum. 
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Let  M  be  the  set  of  differences  of  raetameric  pairs,  i,e.t 
M  ■  {d|d  c  B,  and  for  some  b,c  e  C,  with  b  ~  c,  d  *  b  -  c}. 

The  content  of  Grossman's  third  law  is  that  M  is  a  linear  subspace  of  B. 
This  experimental  finding  has  been  confirmed  in  modem  studies  to  a  high 
degree  of  approximation,  over  a  wide  range  of  conditions  (see  Brindley, 

[ 5  J ,  p,  211).  We  can  now  state  succinctly  the  classical  experimental 
law  of  color  mixture,  the  Law  of  Trichromacy  ( Grossman's  first  law): 

For  a  normal  observer,  dim  B/M  *  3. 

An  observer  is  called  dichromatic  if  dim  B/M  =*  2,  and  monochromatic 
(totally  color  blind)  if  dim  B/M  =  1. 

Standardized  systems  of  color  measurement  employ  a  convenient  basis 
for  B/M.  For  b  e  C,  the  coordinates  of  b  +  M  relative  to  the  basis 
in  B/M  are  called  tristimulus  coordinates  of  b.  Thus,  two  stimuli  have 
the  same  tristimulus  coordinates  if  and  only  if  they  are  metameric. 

The  ICI  standards  specify  tristimulus  coordinates  for  an  average  standard 
observer ,  for  approximately  monochromatic  stimuli  (point  measures).  The 
tristimulus  coordinates  for  more  general  stimuli  are  computed  by  approxi¬ 
matin''  these  as  sums  of  monochromatic  stimuli. 

Another  useful  set  of  coordinates  is  obtained  by  regarding  the  tri¬ 
stimulus  coordinates  for  b  as  homogeneous  (projective)  coordinates  for 
the  one-dimensional  subspace  generated  by  b  +  M.  If  these  homogeneous 
coordinates  are  normalized  so  they  sum  to  1,  they  are  called  ohrmatioity 
coordinates .  Two  stimuli  have  the  same  chromaticity  coordinates  if  a 
scalar  multiple  (change  in  overall  energy  level)  of  one  of  them  is  meta¬ 


meric  to  the  other. 
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5.2  Photopigments 

The  biological  effects  of  light  are  mediated  through  absorption 
by  photopigments.  The  absorbing  properties  of  a  photopigment  are 
specified  by  a  spectral  absorptance  fuiction  p.  For  wavelength  X, 
p(X)  is  the  fraction  of  the  incident  energy  at  wavelength  X  which  is 
absorbed  by  the  pigments  and  converted  into  electrochemical  energy. 

For  a  photopigment  in  human  visual  receptors,  it  is  convenient  to 
include  in  p(X)  the  wavelength-dependent  alterations  in  the  stimulus 
between  the  point  where  it  is  specified  by  a  measure  and  the  point 
where  it  is  absorbed  by  a  receptor,  eg.,  reflection  at  the  cornea, 
absorption  and  scattering  in  the  ocular  media.  If  this  is  done,  then 
the  average  number  of  quanta  absorbed  and  converted  into  electrochemi¬ 
cal  energy,  from  a  stimulus  b,  by  a  photopigment  p,  is  proportional  to 

J  X  p(X)  db(A). 

The  integral  is  t  alien  over  the  visible  spectrum;  the  factor  X  is  intro¬ 
duced  because  average  quanta/unit  energy  is  proportional  to  wavelength. 
Thus,  a  photopigment  p  defines  .1  linear  functional  on  C,  and  by  the  natural 
extension,  on  B  as  well.  (Departures  from  linearity  occur  insofar  as  the 
photopigment  is  appreciably  depleted  by  the  ensuing  photochemical  reac¬ 
tion)  . 

Grassman's  laws  would  he  explained  if  we  assume  that  there  are  3 
linearly  independent  photopigments,  p } ,  p,,  p^,  which  mediate  color 
vision.  The  intersection  of  the  null-  '.us  of  the  p.  would  be  exactly 
M.  The  p.  define  linear  fiuutionals  |.  on  B/M,  which  span  the  dual  space 
of  B/M .  Their  dual  basis  yield-  a  ju  t.  rred  coordinate  system  in  B/M. 
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Relative  to  this  basis,  the  i**1  tristimulus  coordinate  of  b  is  pre¬ 
cisely  Pj(b)  *  /  Ap^(A)db(A),  the  effective  quantal  absorption  of 
stimulus  b  by  p^.  One  may  hope  to  account  for  various  other  properties 
of  color  (for  example,  color  discriminability  or  perceived  hue)  in  a 
simple  way  in  terms  of  those  coordinates,  which  represent  the  basic  physio¬ 
logical  responses  (the  "Grundempfindungen"  of  Helmholtz) .  This  is  the 
thesis  of  the  Ycung-Helraholtz  theory  of  color  vision. 

An  important  variant  of  the  3-pigment  hypothesis  postulates  n  linearly 
independent  photopigments,  p^,  ....  p^,  n  3,  which  in  turn  contri¬ 
bute  linearly  to  3  independent  outputs  q^,  q2,  q^,  where 

n 


l  a..p.,  1  -  t.  2.  3. 


The  q^  are  again  linear  functionals  on  B,  whose  null-spaces  intersect 
in  M,  and  which  induce  a  preferred  coordinate  system  in  B/M,  This 
variant  allows  a  stage  of  "recoding",  represented  by  the  linear  trans¬ 
formation  (a„),  between  photopigment  absorption  and  the  basic  outputs 
that  determine  other  aspects  of  color  vision,  such  as  discriminability, 
subjective  appearance,  etc.  This  sort  of  recoding  is  a  feature  of  many 
color  theories,  particularly,  the  Hering  opponent -colors  theory  as 
quantified  by  Hurvich  and  Jameson  [  ]4) . 

There  is  strong  evidence  that,  if  such  a  recoding  does  take  place, 
nevertheless,  only  3  independent  photopigments  are  involved,  i.e.,  n  »  3, 

The  key  to  the  argunent  is  the  finding  that  metameric  matches  are  not 
broken  down  by  moderate  adaptation  to  colored  lights.  If  b  -  c  for  normal 
adaptation,  then  the  appearance  of  both  b  anc  c  changes  after  adaptation  to 
colored  light,  but  except  for  extreme  adaptations,  b  and  c  still  match  in  color. 
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It  is  generally  assumed  that  the  effect  of  adaptation  involves 
(among  other  things)  bleaching  of  the  photopigments,  i.e.,  the  spectral 
absorptance  function  p^  is  multiplied  by  a  constant  t.,  where  1  -  t. 
represents  the  fraction  bleached.  Let  P  be  the  space  of  linear  functionals 
on  B  spanned  by  p^,  ....  p  .  Then  the  equations 

Tpi  =  tipi  1  =  l>  •  ‘ '  n 

define  a  linear  operator  T  of  P  onto  itself,  with  eigenvectors  p^.  Let 
Q  be  the  subspace  of  P  spanned  by  Dimensionality  considera¬ 

tions  show  that  Q  consists  of  all  the  linear  functionals  on  B  which 
vanish  on  M,  Since  adaptation  leaves  metameric  pairs  invariant,  b  *  c 
implies  that  Tq^(b)  *  Tq^(c)  for  i  *  1,  2,  3;  that  is,  if  b  •  c  t  M, 

then  TqjQi  -c)=0,  i  •»  1,  2,  3.  Hence,  Tq.^  e  Q,  i  *  1,  2,  3, 

and  it  follows  that  the  suDspace  Q  is  invariant  under  all  transformations 
of  form  T.  This  implies  that  Q  is  spanned  by  3  of  the  vectors 
Pj,  ...,  pn>*  Hence,  so  far  as  mediation  of  color  vision  is  concerned, 
there  are  exactly  3  independent  photopigments. 

Recently,  improved  spectrophotometric  techniques  have  yielded  direct 
evidence  that  there  are  3  photopigments  in  human  retinal  cones  (Rushton 
[29];  Marks,  Dobelle,  &  MacNichol  [26]).  The  question  of  whether  signifi¬ 
cant  recoding  of  the  3  photopigment  outputs  takes  place  is  still  a  key 

one  for  color  theory,  but  seems  not  to  be  decidable  on  the  basis  of 

color-matching  data  alone. 

* 

If  the  t^  are  distinct,  then  the  minimum  polynomial  of  T,  acting  in  Q, 

must  have  form  (x  -  t,  )(x  -  t,  )(x  -  t,  ).  Since  the  only  eigenvectors 

i  l  1 2  1 3 

associated  with  t.  are  multiples  of  p.  ,  it  follows  that  p.  ,p,  ,p ,  e  Q. 

U  M  i2  13 
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5,3  Color  appearance 

Tristimulus  coordinates,  determined  by  fixing  an  arbitrary  basis 
for  B/M,  convey  no  information  about  the  phenomenal  appearance  of  color. 
Much  of  the  literature  on  psychology  and  physiology  of  color  is  devoted 
to  establishing  a  preferred  (linear  or  curvilinear)  coordinate  system 
in  B/M,  in  terms  of  which  color  appearance,  including  color  discrimina- 
bility  and  similarity,  can  be  accounted  for,  and  to  characterizing  changes 
of  color  appearance,  correlated  with  changes  in  viewing  conditions,  as 
transformations  in  these  preferred  coordinates.  As  was  shown  in  the 
previous  section,  the  photopigments  provide  one  system  of  preferred 
coordinates,  but  the  possibility  of  recoding  needs  to  be  considered. 

The  most  popular  color  theory  involving  recoding  is  the  opponent- 
colors  theory  of  Hering.  This  was  quantified  by  Hurvich  and  Jameson  [14], 
It  is  based  on  the  observation  that  the  qualities  of  color  appearance 
can  be  grouped  into  3  pairs,  red-green,  yellow-blue,  and  white-black. 

Any  color  partakes  of  at  most  one  quality  from  each  pair.  The  pairs  are 
opponent,  in  the  sense  chat  additive  mixture  prodices  cancellation:  if 
b  looks  red  and  c  looks  green,  then,  under  the  same  viewing  conditions, 
b  ♦  c  looks  either  less  red  than  b  or  less  green  than  c  or  neither  red 
nor  green.  This  suggests  that  the  three  recoded  outputs, 
the  previous  section,  consist  of  a  red-green  output  [ q ^  fb )  >0  if  b  looks 
red,  <0  if  b  looks  green],  a  yellow-blue  output,  and  a  white-black  out¬ 
put.  These  outputs  depend  on  the  photopigment  absorptions, 


as  indicated  in  the  linear  equations  of  the  previous  section,  but  also 
on  other  aspects  of  viewing  conditions,  particularly,  the  presence  of 
other  stimuli  in  adjacent  parts  of  the  visual  field. 

Quantitative  versions  of  opponent- colors  theory  give  a  good  account 
not  only  of  the  basic  opponent -col or  qualities,  but  of  many  other  aspects 
of  color  appearance.  For  example.  Euclidean  distances  calculated  in 
terms  of  differences  in  the  red-green  and  yellow-blue  coordinates, 
give  a  reasonably  good  account  of  color  similarity  for  monochromatic 
stimuli  of  constant  brightness  (Krantz  [16]).  This  example  shows  how 
color  measurement  makes  contact  with  some  of  the  more  general  psychologi¬ 
cal  measurement  ideas  presented  in  earlier  lectures,  in  particular,  with 
problems  of  measurement  of  similarity.  Color  measurement  also  makes 
contact  with  polynomial  measurement;  for  example,  Hurvich  and  Jameson 
postulated  interrelations  among  some  of  the  subjective  qualities  of  color 
that  follow  polynomial  combination  rules. 

The  facts  of  color  appearance,  especially  the  existence  of  opponent 
pairs  of  qualities  such  as  red-green,  etc.,  and  the  usefulness  of  opponent- 
pairs  as  explanations  for  other  color  phenomena,  make  it  relatively  certain 
that  opponent-color  recoding  is  the  proper  basis  for  color  theory, 
although  many  quantitative  details  remain  to  be  worked  out.  Clearcut 
physiological  evidence  for  opponent- color  recoding  has  been  obtained 
from  microelectrode  recording  of  neural  activity  in  the  monkey's  visual 
system,  by  DeValois  and  his  coworkers  [10].  The  exact  physiological 
mechanisms  remain  a  mystery. 
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In  concluding  the  study  of  color  theory  and  color  appearance,  I 
should  like  to  touch  on  the  problem  of  dependence  of  color  appearance 
on  viewing  conditions.  This  dependence  is  quite  dramatic.  For  instance, 
a  stimulus  that  looks  reddish-yellow  under  "normal"  conditions  may 
look  yellow-green  after  exposing  the  eye  io  a  red  adapting  light,  or 
in  the  presence  of  a  bright  red  stimulus  simultaneously  shown  in  an 
adjacent  part  of  the  visual  field.  The  most  precise  tool  for  studying 
these  changes  is  cr\>e  8^  context  matching.  If  we  denote  different  spatio- 
temporal  contexts  by  o,t  etc.,  and  denote  by  b°  stimulus  b  viewed  in 
context  o,  then  we  can  define  b°  -  cT  to  mean  that  stimulus  b,  viewed 
in  context  o,  has  the  same  color  as  stimulus  c,  viewed  in  context  t.  With 
appropriate  experimental  methods,  -  can  be  regarded  as  an  equivalence 
relation.  b°  -  c°  means  that  b  is  metameric  to  c,  and,  as  indicated 
earlier,  it  has  been  found  that  b°  -  ca  implies  bT  -  cT  for  a  wide 
range  of  contexts  t. 

A  pair  of  contexts,  o,t  defines  a  function:  fQ  ^(c)  •  b 
iff  b°  -  cT.  This  is  defined  for  every  c  e  C  whose  appearance,  in 
context  t,  can  be  matched  by  the  appearance  of  some  stimulus  in  context 
o.  Also,  fo  T  is  well-defined  on  the  corresponding  subset  of  B/M, 
and  its  range  can  be  considered  a  subset  of  B/M.  The  study  of  context 
effects  on  color  appearance  is  thus  reduced  to  study  of  the  properties 
of  vector- valued  functions  of  vectors,  of  form  f 

a,  t 

In  the  first  section  above,  it  was  pointed  out  that  one  effect  of 
exposure  to  an  adapting  light  is  likely  to  be  bleaching  of  photopigments, 
Pj  -*•  t^pj.  If  we  postulate  that  equal  appearance  corresponds  to  equal 

m*-*  ** 
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photopigment  outputs,  then  the  corresponding  functions  f  representing 

0»T 

changes  in  this  effect  of  adaptation  Bust  fee  all  representable  by  diagonal 
matrices  in  the  coordinates  corresponding  to  the  dual  basis  of  the  p^. 

This  is  called  the  von  Kries  coefficient  law  [27].  During  the  1950's, 
several  unsuccessful  nttempts  were  made  to  use  empirical  determinations 
of  the  functions  f^  t  to  determine  the  required  coordinate  system,  and 
hence,  to  find  the  photopigment  absorptance  curves  ([6], [25]).  However, 
the  functions  f  seem  much  better  represented  by  a  linear  transformation 

0  ,  T 

plus  a  translation  than  by  linear  transformations  alone.  A  paper  by 
Krantz  [19]  provides  a  general  theoretical  framework  for  study  of  functions 
defined  by  cross -context  matching,  and  yields  the  prediction  of  linear 
transformation-plus-translation  as  a  special  case  in  B/M. 
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The  modern,  etored  program  digital  computer  is  about  twenty  years  old.  Its 
advent  and  the  remarkable  improvements  in  its  components  and  circuitry  have  ere: '.cd 
a  deep  interest  in  the  theory  of  Automata  and  the  methodology  of  problem  formu¬ 
lation  and  solution  by  such  devices.  Out  of  the  study  of  computers  and  the  study 
of  new  formulations  of  various  problems  there  her.  emerged  the  field  of  Computer 
Science,  which  includes  (but  is  not  limited  to)  such  sub-fields  as  computer  cir¬ 
cuitry,  machine  organization  and  logical  design,  numerical  analysis,  theory  of 
programming,  theory  of  automata  and  switching  theory. 

Computer  Science  i3  closely  related  to  mathematics;  indeed,  numerical  analit.ut 
Is  a  branch  of  mathematics  while  many  of  the  problems  arising  from  the  design  and 
use  of  computers  are  intimately  associated  with  questions  in  combinatorial  rnthe- 
matics,  abstract  algebra  and  symbolic  logic.  But  an  even  more  fundamental  rela¬ 
tionship  also  exists.  The  inherent  structure  of  a  computer  forces  one  who  sucess- 
ful.ly  studies  or  uses  it  to  strive  for  the  type  of  generality,  abstraction  and 
close  attention  to  logical,  uetail  that  is  characteristic  of  mathematics1  argument e . 

A  brief  review  of  the  logically  distinct  units  of  p.  computer  system  is  useful 
in  understanding  the  nature  and  the  growth  of  various  sub-fields  t.f  Computer  Selene 
Inasmuch  C3  ouch  a  device  i3  a  general -purpr§e  one  and  automatic  (i,e.  independent 
of  the  human  operator  after  the  solution  of  a  problem  starts)  it  should  contain 
certain  main  organs:  (l)  a  central  processing  unit  (an  arithmetic  unit),  (?)  a 
memory,  (3)  a  control  unit  and  (4)  input-output  devices. 

The  function  of  the  central  processing  unit  (CFU)  is  to  perform  various  opera¬ 
tions  on  arrays  of  bits  contained  in  o.t»  These  arrays  of  bite  are  usually  called 
words  end  may  represent  numbers  or  non-numerical  data  drawn  from  many  diverse  fiei< 
The  crucial  point  is  that  with  modern  electronic  techniques,  the  CPU  can  perform 
its  operations  with  fantastic  speeds,  10^  to  10^  operations  per  second  for  some  CIV 


2 


and  softs.’  operations.  Thus  if  the  computer  is  to  be  used  in  a  manner  conmensurate 
with  its  abilities,  it  must  have  available  to  it  at  speeds  comparable  to  its  exe¬ 
cution  speeds  a  sequence  of  operators  and  operands. 

Such  sequences  are  Kept  in  the  memory  of  the  computer. 

there  oust  be  an  organ  in  the  computer  which  will  automatically  call  forth  the 
various  operators  and  operands  and  execute  the  former.  This  device  is  called  the 
control  of  the  computer.  Hie  list  of  distinct  operations  which  the  control  can 
execute  is  called  the  code  of  the  computer. 

these  three  units  are  internal  to  the  computer  itself.  There  must  also  exist 
devices,  the  input-output  organ,  whereby  the  human  operator  and  the  machine  can 
conmunicate  with  each  other. 

Hie  design  and  construction  of  devices  with  the  properties  listed  above  has 
stimulated  great  interest  in  physical  phenomena  which  may  be  exploited  to  construct 
components  for  computers.  Thus  many  computer  laboratories  have  studied  and  are 
continuing  to  study  the  possibility  of  using  lasers  and  masers  as  computer  memories. 
The  possibility  of  using  cyrogenic  techniques  in  the  construction  of  computers  has 
also  been  under  intensive  examination. 

The  design  and  construction  of  computers  has  also  stimulated  a  'great  deal  of 
theoretical  work  ranging  from  the  "practical"  aspects  of  switching  theory  to  the 
theory  of  automata.  I  would  characterize  the  first  limit  mentioned  above  as  being 
concerned  with  problems  such  as  the  following:  What  is  the  minimum  number  of  ele¬ 
ments  with  the  ability  to  perform  prescribed  elementary  logical  operations  (for 
example  to  accept  as  input  two  logical  variables  x  and  y  and  to  have  as  an  output 
the  single  logics'  variable  "x  and  y")  needed  to  construct  a  device  capable  of  forming 
a  given  Boolean  function  of  n  Boolean  variables. 

Automata  theory  may  be  said  to  have  originated  with  Turing's  work  ^  on  the 
general  definition  of  what  is  meant  by  a  computing  automaton.  Present  day  prob¬ 
lems  in  this  logical  -  mathematical  theory  are  concerned  with  abstract  models  of 
devices  whose  behavior  at  &  particular  instant  of  time  depend  not  only  on  the  pre- 


r;ent  input  to  the  machine,  but  generally  on  the  entire  past  history  of  the  device 

including  past  inputs.  In  the  main  it  is  essentially  a  chapter  in  formal  logic 

[2] 

and  as  such  has  the  property  characterized  by  von  Neumann  of  being  "cut  off 
from  the  best  cultivated  portions  of  mathematics,  and  forced  onto  the  most  diffi¬ 
cult  part  of  the  mathematical  terrain,  into  combinatorics". 

In  the  paper  from  which  the  above  flotation  v;as  taken,  von  Neumann  predicted 
that  automata  theory  would  differ  from  formal  logic  in  tv/o  respects:  "(l.)  The 
actual  .length  of  'chains  of  reasoning',  that  is,  the  chains  of  operations,  will 
have  to  he  considered  and  (2)  The  operations  of  1 orie. . , .will  have  to  be  treated 
by  procedures  which  allow  exceptions  with  low  but  non-zero  probabilities".  He 
expected  that  the  need  to  take  account  of  point  (?)  would  bring  the  theory  of  auto¬ 
mata  closer  to  analysis  than  to  combinatorics.  This  has  not  happened  as  yet. 
Current  research  in  Automata  theory  is  concerned  with  problems  arising  in  connec¬ 
tion  with  the  use  of  computers,  in  contrast  to  the  design  of  computers.  Thus 
many  workers  in  the  field  are  studying  among  other  topics,  the  theory  of  program¬ 
ming  languages,  algebraic  coding  theory  and  pattern  recognition. 

When  computers  are  used  in  problem  "solving"  they  must  be  furnished  with 
programs:  lists  of  instructions  which  must  be  scanned  by  the  control  and  executed. 
If  this  were  .just  a  linear  scanning  of  the  secuer.ee  of  instructions  which  remain 
unchanged  in  form,  then  matters  would  be  simple.  Programming  a  problem  for  the 
machine  would  consist  in  translating  a  meaningful  text  from  one  language  (for 
example,  the  language  of  mathematics  in  which  the  planner  will  have  conceived  the 
problem)  into  another  language  (that  one  of  the  code  of  the  computer). 

This  is  not  the  case.  Because  computers  execute  orders  with  greav.  spced6, 
it  is  necessary  to  use  iterutive  and  inductive  algorithms  for  problem  solving. 

Then  the  relation  between  the  program  to  the  mathematically  conceived  procedure 
of  solution  is  not  a  statical  one,  that  of  translation,  but  highly  dynamical: 

A  program  stands  not  simply  for  its  contents  at  a  given  time  in  reference  to  a 
given  set  of  memory  locations,  but  more  fully  for  any  succession  of  passages  of 
the  control  through  it  with  any  succession  of  modified  contents  to  be  found  by  the 
control  there;  ell  of  this  being  determined  by  all  other  orders  of  the  sequence 


k. 

of  instructions  (in  conjunction  with  that  instruction  nw  "being  executed  by  the 
control). 

The  theory  of  programming  is  in  part  concerned  with  techniques  for  providing 

a  dynamic  background  to  control  the  automatic  evolution  of  a  meaning,  von  Neumann 
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and  Goidstine1  who  so  characterized  programming  viewed  that  subject  as  a  new 
branch  of  formal  logics  and  indicated  methods  for  mastering  various  parts  of  it. 
Their  work  and  that  of  a  large  body  of  subsequent  workers  has  been  devoted  to  tech¬ 
niques  involved  in  preparing  problems  for  solution  by  a  computer. 

Problem  oriented  programming  languages  such  a,.  ALGOL,  FORTRAN  and  a  host  of 
others  have  been  devised.  In  addition  special  languages  for  dealing  with  special 
types  of  problems  such  as  COBOL,  SNOBOL  and  LISP  have  also  been  created. 

The  existence  of  these  languages  and  the  compilers  they  require  an  well  as 
the  desire  to  have  efficient  use  of  computers  has  led  to  the  introduction  of  opera¬ 
ting  programs,  monitoring  systems  and  executive  systems.  That  is,  programs  that 
oversee  the  use  of  the  computing  equipment.  TTi 5 r>  development  has  led  to  the  crea¬ 
tion  of  a  special  class  of  programmers  who  are  called  system  programmers  and  who 
have  created  a  branch  of  programming  called  s -/stems  programming. 

Although  the  original  impetus  for  developing  modern  computers  arose  from  the 
need  for  better,  bigger  and  faster  "ariUimetic  engines’',  computers  are  now  being 
increasingly  used  for  such  non-numarieal  tasks  as  the  simulation  of  various  complex 
systems,  the  solution  of  problems  involving  only  complicated  logical  operations 
and  even  the  design  of  new  computers  and  computer  systems.  The  above  list  of  use3 
is  by  no  means  exhaustive  and  many  of  the  uses  have  grown  into  .ophisticated  bodies 
of  knowledge  with  an  associated  theory  that  is  given  a  label  as  u  sub-field  of 
Computer  Science. 


Since  other  lectures  will  cover  some  of  these  fields  in  detail  I  ohall  con¬ 
fine  the  remainder  of  my  time  to  problcsms  arising  when  one  attempts  to  understand 
the  errors  involved  when  one  uses  a  modern  computer  to  obtain  numerical  solutions 
of  mathematically  formulated  underlying  physical  or  engineering  problems.  J.  von 


rieuninnn  and  II. It.  Goidstine 


P<) 


have  pointed  out  the  following  ;  >-ir  source:; 
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errors  in  obtaining  numerical  solutions  of  an  underlying  physical  or  engineering 
problem: 

I  ERRORS  OF  FORMULATION.  These  arise  due  to  the  fact  that  the  mathenatical 
formulation  of  an  underlying  physical  or  engineering  problem  represents  such  a 
problem  only  with  certain  idealisations,  simplifications  and  neglections.  In 
other  words  the  problem  that  is  actually  formulated  to  be  solved  mathematically, 
the  problem  that  is  called  the  rigorous  mathematical  problem,  is  in  itself  an 
approximation  to  some  other  problem. 

There  are  many  examples  of  these  approximations.  Thus  in  the  theory  of 
fluid  mechanics  the  notion  of  a  perfect  fluid,  one  without  viscosity  and  without 
heat  conductivity,  is  an  approximate  representation  of  a  real  compressible  fluid. 

In  certain  circumstances  it  represents  the  behavior  of  a  compressible  fluid  ade¬ 
quately  end  its  importance  arises  from  this  fact.  However  in  other  cases  it  is 
completely  inadequate  to  deal  with  the  physicul  problem  involved  and  heat  conduo> 
tivity  and  viscosity  have  to  be  admitted  into  the  theory.  Even  the  theory  of  a 
viscous  heat  conducting  compressible  fluid,  which  i6  much  more  involved  than  that 
of  perfect  fluid,  is  inadequate  for  some  problems.  These  problems  require  the 
replacement  of  the  representation  of  a  fluid  as  a  continuum  by  the  representation 
of  a  fluid  as  a  collection  of  molecules,  as  is  done  in  the  kinetic  theory  of 
gases.  Ihus  in  this  one  field  we  find  that  successively  finer  representations  of 
the  physical  problem  have  to  b«  made  and  that  the  mathematical  tools  needed  for 
making  these  refinements  are  quite  different  in  character. 

It  is  the  relationship  between  the  errors  of  formulation  end  those  listed 
below  that  I  want  to  stress  today, 

II  ERRORS  IN  VALUED  OF  P/dtlWETERq.  The  nwthemetical  formulation  of  a  problem, 
with  the  idealisations  discussed  under  I,  may  involve  parameters  whose  values  have 
to  be  inserted  in  a  numerice.l  computation.  However,  these  parameters  any  net  be 
known  with  sufficient  accuracy  and  tlus  will  introduce  errors  which  may  then  "infect" 
the  solution. 
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The  discussion  of  the  effects  of  this  source  of  errors  lead--  to  the  following 
n»then»tical  problem:  Are  the  solutions  of  the  rigorouB  mathematical  problem  (the 
approximation  to  the  underlying  problem)  continuous  functions  of  the  parameter? 
Depending  on  the  answer  to  this  question  the  importance  of  the  errors  of  this 
category  can  be  assessed. 

Ill  ERRORS  OF  TRUNCATION.  The  mathematical  problem  described  under  I  may 
involve  transcendental  functions  and  operations  which  will  have  to  be  evaluated 
by  use  of  a  finite  sequence  of  elementary  arithmetic  operations.  For  exanple, 
the  exponential  function  and  the  trignometric  functions  are  usually  evaluated 
by  the  use  of  polynomial  or  rational  function  approximations  and  intergrals  and 
derivatives  are  evaluated  by  quadrature  formulas  and  finite  difference  approxi¬ 
mations  respectively. 

Thus  the  rigorous  mathematical  problem  which  is  an  approximation  to  an  under¬ 
lying  problem  is  further  approximated  for  computational  reasons.  The  interaction 
between  these  two  approximations  is  not  sufficiently  stressed  by  the  people  who 
prepare  problems  for  and  make  use  of  computers  to  obtain  numerical  solutions  of 
problems. 

The  approximation  of  one  problem  b;,  another  raises  two  mathematical  questions 
of  classical  numerical  analysis:  (l)  the  question  of  convergence  and  (2)  the 
question  of  stability.  These  questions  car.  be  illustrated  by  considering  a  problem 
involving  ordinary  or  partial  differential  equations. 

When  the  differential  equation  is  replaced  by  a  finite  difference  equation 
a  mesh  is  introduced  over  the  independent  variables.  That  is,  only  discrete  sets 
of  values  of  the  independent  variables  are  used  in  the  problem.  The  question 
arises  as  to  the  behavior  of  the  solution  of  the  difference  equations  at»  the  number 
of  elements  in  this  set  Increases.  One  must  search  for  those  discrete  formulations 
of  the  problem  which  have  the  property  ♦hat  their  solution  converges  to  the  solu¬ 
tion  of  the  transcendental  problem  at?  the  number  of  elements  in  the  discrete  set 
becomes  infinite. 
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Hie  stability  problem  involves  the  question  as  to  whether  the  solution  of 
the  differential  equations  and  the  approximating  difference  equations  are  con¬ 
tinuous  functions  of  the  initial  end  boundary  conditions.  Ihia  question  ia  of 
importance  in  of  the  fact  that  because  of  round-off  errors,  which  will  be 
discussed  below,  it  is  impossible  to  satisfy  these  conditions  exactly.  Thus 
one  needs  some  assurance  that  the  errors  introduced  from  this  source  will  not 
amplify  but  will  indeed  decrease  in  importance,  that  is,  will  damp  out. 

IV  ROUND-OFF  ERRORS.  These  errors  arise  because  the  "elementary  arithmetic 
operations"  of  a  computing  machine  are  not  rigorously  and  faultlessly  performed. 
Thus  in  r.  digital  machine  real  numbers  x  and  y  are  replaced  by  digital  approxi¬ 
mations  x  and  y  respectively.  1 he  quantities  x  ±  y  may  then  have  corresponding 
digital  representations,  that  is,  we  may  have 


TTTy)  x=  x  t  y  . 

However  as  far  as  products  and  quotients  are  concerned,  these  are  governed  by  the 
relations 

Txyl  «  x  y  ►  rs*^  *  x  y  +• 

713  ■  V-  •  n.<1>  ■  %  *  „U> 

where  n  ^  (and  n^)  and  a  ^  (and  n  are  the  rourd-off  errors  of 
multiplication  and  division  respectively. 

Round-off  errors  are  as  old  as  the  art  of  computation  Itself.  If  multi¬ 
plications  and  d. visions  were  not  rounded -off  even  elementary  computations  would 
soon  lead  to  the  use  of  reams  of  paper.  Kovevcr,  with  the  advent  of  high-speed 
coaputers  the  great  numbers  of  such  arithmetic  operations  become  possible  and 
one  sjust  consider  the  effect  these  errors  have  on  the  validity  of  th'*  results  obtained. 


8. 

There  is  as  yet  no  general  theorem  governing  the  effect  of  round-off  errors 
and  every  problem  formulated  for  machine  solution  has  to  be  analysed  separately 
to  obtain  an  estimate  of  what  the  round-off  error  is  and  vhat  importance  it  has 
for  the  problem  at  hand. 

The  problems  with  which  the  theory  of  round-off  and  truncation  errors  should 
be  concerned  are  closely  related  to  the  problem  posed  by  Kronecker  when  he  in¬ 
sisted  that  mathematics  be  formulated  in  a  manner  which  involved  finite  construc¬ 
tions.  When  we  formulate  problems  for  solution  by  computing  machines  we  are 
attempting  to  follow  Kronecker1 s  dictum  with  at  least  one  important  modification: 
namely,  arithmetic  is  not  being  carried  out  faultlessly.  The  saving  grace,  if 
any,  being  that  the  errors  committed  in  the  arithmetic  processes  are  known. 

The  formulation  of  such  a  theory  presents  a  challenge  to  pure  and  applied 
mathematicians  as  deep  and  as  important  as  any  problems  presently  being  dealt 
with.  Unfortunately  many  people  are  unaware  of  this  challenge.  Many  mathema¬ 
ticians  seem  to  feel  that  problems  connected  with  computation  are  concerned 
solely  with  arithmetic,  a  subject  whose  avoidance  was  partly  responsible  for 
their  becoming  interested  in  mathematics,  and  therefore  such  problems  are  to  be 
shunned. 

I  now  wish  to  turn  to  the  discussion  of  the  interaction  between  the  errors 
of  formulation  and  the  errors  o'’  truncation.  My  thesis  is  that  such  an  interaction 
exists  and  that  it  may  be  profitably  exploited  to  obtain  easier  and  more  correct 
computing  algorithms  for  dealing  with  various  problems.  I  shall  illustrate  this 
thesis  by  examples.  Before  doing  this  I  should  point  out  that  the  existence  of 
present  day  (and  future)  computing  machines  make  (and  will  make)  the  exploitation 
of  this  interaction  possible  in  some  cases  where  without  such  machines  it  is  not 
feasible  to  do  anything  very  different  from  what  has  been  done  in  the  past. 

I  shall  first  discuss  what  may  be  considered  a  ridiculous  example  but  what 
I  hope  will  not  long  be  a  ridiculous  one.  This  example  is  concerned  with  a  method 
for  dealing  with  problems  in  the  theory  of  fluid  dynamics.  That  theory  deals  with 
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a  continuum  over  which  there  are  various  fields  defined:  the  velocity  field, 
the  pressure  field  and  the  density  field.  The  physicist  considers  the  fluid  as 
a  collection  of  molecules  moving  about  in  space  in  accordance  with  certain  laws 
of  motion  and  introduces  these  various  fields  in  terms  of  averages  of  other  quan¬ 
tities  defined  for  the  mo'.ocules  themselves. 

When  a  problem  in  fLuid  dynamics  is  formulated  in  terms  of  a  continuum  and 
fields  defined  over  thin  continuum  which  satisfy  certain  differential  equations 
and  when  these  equations  are  replaced  by  finite  difference  equations  ve  are  in 
effect  replacing  the  iluid  by  a  discrete  collection  of  particles.  One  should 
then  a6k  the  question  as  to  what  relation  this  collection  of  particles  has  with 
the  physicist's  molecules.  Indeed  one  should  ask  why  go  throuefr  this  chain  of 
approximations  at  e.UT  Can  we  not  deal  with  the  physicist's  molecules  directly 
and  won't  this  give  more  significant  results  than  the  former  procedure? 

The  answer  to  the  second  question  seems  to  be  that  with  existing  computers 

wo  cannot  keep  track  of  enough  molecules  for  u  long  enough  time  to  make  it  into 

a  feasible  method  for  hand line  fluid  dynamics  problems.  However  this  answer  needs 

to  be  looked  at  closely.  C  r, .  reason  for  this  is  that  no  sharp  estimates  are  at 

hand  aG  to  when  the  law  of  large  numbers  becomes  operative;  will  an  assembly  of 

23 

thousands  of  molecules  behave  essentially  the  same  as  10  molecules  as  far  as 

23 

the  central  limit  theorem  of  probability  is  concerned  or  are  10  molecules  really 
needed? 

The  work  of  Nord sleek  and  Hicks ^  on  the  application  of  Monte  Carlo  tech¬ 
niques  to  the  solution  of  the  Boltzmann  equation  shows  that  computers  may  be 
effectively  used  in  obtaining  the  molecular  velocity  distribution  under  condi¬ 
tions  far  from  equilibrium.  Hence  computers  enable  one  to  deal  with  many  fluid 
dynamical  problems  in  a  quite  novel  manner. 

There  are  many  instances  in  which  discrete  problems  sure  approximated  by  dif¬ 
ferential  equations  which  in  turn  are  approximated  by  difference  equations  and 
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then  solved  by  use  of  computing  instruments.  Thus  errors  of  formulation  and  trun¬ 
cation  are  needlessly  introduced,  for  vith  the  advent  of  modern  high- speed  computers 
the  original  discrete  problem  could  be  handled  directly. 

Many  boundary  valued  differential  equation  problems  originate  from  variational 
principles.  This  fact  can  be  used  to  obtain  simpler  and  easily  solved  discrete 
approximations  to  the  defining  equations  for  the  unknown  functions.  This  is 
another  illustration  cf  the  fact  that  the  interaction  between  the  formulation 
process  and  the  truncation  process  may  be  exploited  to  obtain  more  meaningful  and 
mar®  accurate  approximations  to  problems  posed  for  numerical  solution  by  computers. 

I  shall  not  discuss  this  point  further  but  refer  you  to  a  paper  where  this  ap¬ 
proach  has  been  used  on  Sturm-Liouville  differential  equations. 

I  shall  devote  the  remainder  of  my  remarks  to  the  interaction  of  round-off 
errors  and  truncation  errorr.  Consider  the  problem  of  solving  the  equation 

x  =  G  (x)  (1) 

or  a  computer.  We  shall  assume  that  x  is  a  real  scalar  variable  and  G  (x)  is  a 
function  such  that  there  exists  real  numbers  r  and  b  for  which 

|G(x)  -  r|  b  jx  -  r| 

where 

0  <_  b  <  1 

This  condition  insures  the  convergence  to  r  of  the  sequence  of  xn's  defined  by 

xn+l  -  0  <V  n  ‘  °>1- . 

When  one  wishes  to  find  the  numbers  x  satisfying  equation  (l)  one  may  attenmt 
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to  do  this  by  generating  on  a  computer  the  sequence  of  x  *s  satisfying  equa¬ 
tion  (2).  However  because  of  truncation  errors  one  will  attempt  to  solve  an 
equation  of  the  form 


v„  ♦  i  -  H  <V 


0,1. « 9 


(3) 


where 


H  (x)  -  G  (x)  +  (T  (x)  (4) 

and 

|C  (x) |  <  a  (5) 

a  being  a  constant.  The  function  H  (x)  i6  some  algebraic  approximation  to  the 
function  G  (x)  which  in  turn  may  be  a  transcendental  function.  The  function 
is  the  truncation  error  and  a  is  a  bound  for  it. 

r  7i 

Descloux  has  shown  that  for  any  Vq  the  sequence  defined  by  equation 
(3)  is  bounded  and  all  its  points  of  accumulation  V  satisfy  the  inequality 

IV  -  r|  <  .  - 

I  !  _rrT 


He  has  further  shown  that  the  scheme  given  by  equation  (3)  is  the  best  possible 
one  for  solving  equation  (2)  in  the  following  sense:  for  given  a  and  b  there  exists 
a  function  h(x)  for  which  it  is  impossible  to  find  an  algorithm  using  only  H,  a 
and  b  providing  closer  points  of  accumulation  to  r  than  the  algorithm  (3). 

Because  of  round-off  errors  a  computer  will  not  generate  the  sequence  Vn«  If 
the  computer  uses  fixed  point  arithmetic  it  will  generate  a  sequence  of  Integers 


'n  +  1 


1  0  (jrn>  *  s  A 
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.  where  [  ]R  1§  celled  a  rounding  procedure  and  [x]Ris  any  integer-valued  function 
of  x  satisfying  the  Inequality 


I  tV  -  xj  <  1. 

Ifce  normal  rounding  procedure  is  defined  as 

(x)N  -  [x  +  0.5]  . 

This  is  the  procedure  that  is  usually  used  in  hand  computations  and  is  Incorporated 
in  many  computers.  Descloux  showed  that  for  any  yQ  the  sequence  of  integers 
defined  by 

\  *  1  -  tG  <yn)  *  'Mu 


there  exists  an  N  such  that 


1 

2  (l  -~V5 


(6) 


for 


n  >  N; 

furthermore  for  given  a  and  b,  there  exists  a  function  G  and  errors  £  for 
which  the  bound  is  attained. 

Equation  (6)  then  tells  us  what  accuracy  can  be  ejected  in  solving  equation 
(l)  by  the  iterative  scheme  (2)  when  given  truncation  and  round-off  errors  are 
Introduced.  He  has  obtained  the  corresponding  result  when  floating  point  arith¬ 
metic  is  used.  I  shall  not  give  it  here.  Die  point  I  wish  to  stress  is  that  con¬ 
vergence  may  be  in  error  by  a  significant  ameunt. 
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Let  us  consider  an  example.  Suppose  we  wish  to  solve  the  equation 


x 


The  problem  is  a  trivial  one  but  let  us  suppose  that  we  do  not  know  that  the  solu¬ 
tion  is  x  *  0  but  attempt  to  solve  the  equation  by  generating  the  sequence 


If  we  use  fixed  point  arithmetic  with  the  decimal  or  binary  point  at  the  right  of 
the  registers  we  shall  generate  the  sequence  of  integers 

yn  +  1  “  ^  yn  +  21  r  • 


Thus  starting  with  yQ  •  8,  we  obtain  y^^  •  7,  yg  ■  6,  y^  ■  5,  y^  ■  U,  y^  •  k,  yQ  -  h 
n  £  5*  Thus  the  binary  machine  solution  will  be  4  x  2  p  where  p  is  the  number  of 
binary  bits  in  the  machine  representation  of  numbers.  Of  course  If  p  is  large 
this  error  is  tolerable.  On  the  other  hand  the  result 

x  ■  h  x  2"^ 


is  due  to  the  fact  that 

1  1 

h  ■  *  ■  "  ■  “  "w 1 

2  (1  -  b)  2  (1  -  J) 

Hence  if  b  is  close  to  one,  the  error  may  very  well  be  Intolerable. 

Equation  (6)  furnishes  one  with  a  basis  for  evaluating  different  truncating 
schemes.  Thus  if  )a|  «  i  for  each  of  two  truncating  schemes,  the  error  in  the 
result  will  be  mainly  determined  by  the  round-off  and  hence  there  is  no  reason  for 
using  the  more  elaborate  truncation  procedure. 
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The  question  as  to  Aether  round-off  procedures  can  be  devised  which  lead  to 
better  results  has  also  been  considered  by  Descloux.  He  extended  gome  results  of 
A*  ltOrdsleck  and  shoved  that  there  exists  a  round-off  procedure  such  that  the  yQ 
satisfying 


*  1  ”  +  f0  <*»>  *  A 


are  such  that  for  any  there  exists  an  H  such  that 


|y  «  -  r|<  -'ft—  +  1 

•  *  35  +1  1  1  -  b 


for 


n  >  H. 

That  is  the  round-off  error  can  be  reduced  to  one  bit  in  the  last  place.  The 
round-off  procedure  is  called  Anomalous  rounding  for  it  violates  one's  intuition. 
Its  rules  are 


for  | x |  <_  1  |[xJAl  >  x 

for  |x|  ^  1  |  [x)A!  <_  x  . 


Thus  when  x  is  snail  enough  one  rounds  up,  that  is,  increases  the  errcr  due  to 
rounding.  Ho  one  has  yet  been  able  to  extend  these  results  to  the  case  of  vector 
problems.  That  is,  to  the  case  vhere  x  is  a  vector  and  G  (x)  Is  a  vector 
valued  function.  Thus  there  remains  an  open  important  problem  in  understanding 
the  interaction  between  round-off  errors  and  truncation  errors. 
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TWO  DIMJTiMONAL  VISUAL  GEOMETRY 
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We  Bliali  try  to  describe  a  geometry  base  a  exclusively  os  the 
concept  "from  my  position  I  see  an  object  A  to  the  left  of  object  1", 
Two  dimensional  rnean^  1.  ^ kuat  an  observer  and  all  observed  objects 
are  on  the  two  dimensional  Euclidean  plane. 

For  the  objects  we  shall  take  narked  points  on  the  Euclidean 
plane  (intuitively,  they  are  visible  and  individually  distinguishable 
objects).  The  observer  sill  be  characterised  by  his  angle  of  vision 
wliich  can  vary  fros  0  to  360  degrees.  The  observer's  position  on  the 
plane  is  characterized  by  a  point  (the  point  where  he  is  standing) 
and  the  angle  equal  to  his  visual  angle  with  the  vertex  in  the  point, 
(the  angle  covers  the  area  he  is  observing).  The  observation  fros  such 
a  position  is  the  permutation  (sequence)  of  narked  points  which  are 
within  his  visual  angle  listed  in  order  fros  left  to  right 


•C 


In  the  first  picture  the  observer  has  a  visual  angle  of  degrees 
Fros  hia  position  sarkeo  by  0,  he  made  the  observation  (A,  A.  Cj, 

(f><  sees  a  to  the  left  of  d,  and  fi  to  the  left  of  C).  Is  the  second 


I 


2 


picture  the  observer’s  visual  angle  is  90  degrees  and  hie  obsc-vation 
is  [a,  D,  CJ,  Ju  is  not  in  his  field  of  vision. 

Wt*ark ,  shall  not  consider  the  cases  in  which  the  observer' « 
position  is  in  a  straight  line  joining  two  visible  points  so  that  one  of 
the*  is  directly  in  front  of  another;  but  this  restriction  can  be 
easily  avoided. 

Let  be  giv--  a  set  of  observational  points  X  ana  a  net  of  marked 
points  Y  fro*  points  0  in  X  vill  be  called  the  description  of  Y.  liuch 
triplet  (X,  Y,  a)  will  be  called  a  visual  geometry,  as  the  concepts  of  such 
s  geometry  we  admit  only  those  which  can  be  defined  in  terms  of  the  description 
of  Y.  for  example,  in  the  geometry  where  X  is  the  whole  plana  and 
«  -2=  l80  degrees  we  can  express  the  fact  that  point  a  lies  within  the 
triangle  d,  b,  C  in  the  following  way:  in  no  observation  in  wnich 
*  a,  Bt  C,  D  are  visible  doea  a  occupy  outer-moat  left  or  outer-most 
right  position. 

In  the  raoe  in  wnich  Y  has  exactly  n  elements  there  are  only  finitely 

■any  possible  descriptions  of  the  uet  Y  (the  description  ie.a  set  of 

permutations  with  elements  in  Y).  In  some  ca.  es  more  exact  entia.ttionc  can 

be  given.  Kor  exam;  le,  when  X  is  a  whole  plane  and  <i  »  j60  decrees  the 

4 

number  of  dlffeicnt  descriptions  of  Y  h on  tne  magnitude  of  r.  . 

In  c..ue  enen  tne  *V  Y  consists  ol  <11  points  on  the  plane  (or 
any  dense  act  of  points)  tn<r  the  resulting  geometry  is  affine  geometry  In 
the  following  sense:  if  me  act  X  contain.')  at  lead  lour  non-coliinear 
points,  then  the  only  transfci w  . t ion  of  the  >1  r.e  into  ituelf  wnich 
prsherves  the  o.acrij  tion  <r*  nMine  tr  »r.ftfor»«tiono. 
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It  follows  froa  ths  lsst  theorem  that  If  vs  have  two  configurations 
of  points  which  art  not  equivalent  in  affins  geometry  as  can  always 
add  such  finits  set  of  points  to  sach  configuration  so  that  they  will 
not  bs  equivalent  (will  havs  different  properties)  in  respective  finits 
geometries* 

This  shows  that  better  approximations  of  affine  concepts  can  be 
achieved  by  extending  set  Y  of  visible  points  and  not  set  X  of 
observation  points,  what  can  be  called  a  type  of  context  sensitivity* 
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